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CRYSTAL STRUCTURE OF WW DOMAINS AND METHODS OF USE 

THEREOF 



The U.S. Government may have certain rights in this invention pursuant to 
Grant No. GM57533 awarded by the USPHS. 

5 FIELD OF THE INVENTION 

The present invention relates to crystals of WW domains and more 
particularly to the high resolution structure of Pinl WW domain obtained by X-ray 
diffraction. In addition, the invention relates to methods of using the structure 
coordinates of Pinl WW domain and mutants thereof to screen and design compounds 
1 0 that bind to or interact with WW domains. 

BACKGROUND 

The process of designing potent and specific inhibitors has improved with the 
arrival of techniques for determining the three-dimensional structure of the enzyme to 
be inhibited. Usually a three-dimensional model of an enzyme is produced by the 

15 creation of a crystalline form of the purified enzyme which is then subjected to X-ray 
diffraction and analysis. While such procedures provide certain valuable information 
that can be used to design inhibitors, they suffer from a lack of knowledge about the 
amino acid residues critical for interaction with a substrate or a substrate mimic. In 
order to address these limitations, enzymes have more recently been co-crystallized 

20 with substrates, substrate mimics or known inhibitors of the enzyme's activity, 
thereby allowing the important interactions to be determined (see, for example, 
Mohammadi, etaL, Science 276:955-960, 1997; Lee, etal 9 Biochemistry 36: 13 180- 
13186, 1997; Brzozowski, et al 9 Nature 389:753-758, 1997). 

The peptidyl-prolyl m-trans isomerases (PPIases), or rotamases, are a family 
25 of enzymes important in protein folding, assembly and transport. They act as 

catalysts to promote isomerization about the peptidyl-prolyl bond, which can have 
profound effects on protein function. 

PPIases are divided into three classes, cyclophilins, FK-506 binding proteins 
(FKBPs) and the Pinl/parvulin class. While cyclophilins and FKBPs are 
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distinguished by their ability to bind immunosuppressant molecules cyclosporin and 
FK-506, respectively, the Pinl/parvulin class binds neither of these 
immunosupressants and is structurally unrelated to the other two classes. Known 
members of the Pinl/parvulin class include Pins 1 - 3 (Lu, et ah, Nature 380:544-547, 
5 1996), Pin-L (Campbell, et ai, Genomics 44:157-162, 1997), parvulin (Rahfeld, et 
aL, FEBS Letts 352:180-184, 1994), dodo (Maleszka, et aL 9 Proc Natl Acad Sci USA 
93:447-451, 1996) andEssl/Pftl (Hanes, et aL, Yeast 5:55-72, 1989; and Hani, etal., 
FEBS Letts 365:198-202, 1995). 

Recent research suggests that members of the Pinl/parvulin class are essential 
10 modulators of the cell cycle, and mitosis in particular. Lu, et aL, Nature 380:544-547. 
1996 (incorporated by reference herein) reports that depletion of Pinl/Essl (a 
structural and functionally related protein to Pinl) in yeast or human cells induces 
mitotic arrest followed by apoptosis, indicating that enzymes in this class serve an 
essential function in cell division and proliferation. 

15 The design of new, highly specific antimitotic agents represents an important 

need in the pharmaceutical industry. Such agents can serve as effective 
chemotherapeutic agents for the treatment of a variety of disorders characterized by 
inappropriate cell proliferation, including cancer and infectious diseases. The 
invention disclosed herein addresses this and related needs, as will become apparent 

20 upon review of the specification and appended claims. 

SUMMARY OF THE INVENTION 

A Pinl WW domain crystal structure is provided. Methods of using the crystal 
structure and atomic coordinates for the development of WW domain binding agents 
is also provided. In addition, the disclosure provides computer programs on computer 
25 readable medium for use in developing WW domain binding agents. 

BRIEF DESCRIPTION OF THE FIGURES 



Figure 1 collectively shows the overall architecture of human PinL 
Figure 1 A shows the ribbon representation of the Pinl-C-Terminal Domain (CTD) 
peptide complex. Residues 39 to 50 are disordered. Apostrophes distinguish the WW 
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domain from the PPIase domain. The CTD peptide backbone is in the region labeled 
as N, Pro3' and Pro6\ Residues of the CTD peptide are labeled with apostrophes. 
Atoms of carbon, nitrogen, phosphorus, oxygen, and sulfur are depicted. Dotted lines 
depict hydrogen bonds. Figure IB shows a ribbon representation of the Pinl-PEG 
5 complex. 

Figure 2 collectively shows an enlarged view of the Pinl -CTD peptide 
binding interface. Figure 2A shows a ribbon diagram of the Pinl WW domain bound 
to YpSPTpSPS depicted after a 90° rotation around a vertical axis from the view 
shown in Figure 1 A. This view is looking on the concave WW domain peptide- 

10 binding surface opposite the PPIase domain. The carbon atoms of the CTD peptide 
are in the foreground compared to the WW side chain atoms. The water molecule 
mediating Tyr-23 / phosphate contacts is shown as a sphere. Hydrogen bonds are 
shown as dotted spheres. Figure 2B shows the molecular surface representation of 
the WW domain - peptide interface after a slight rotation around the vertical axis from 

15 the view depicted in Figure 2 A affords this view. The solvent-accessible surface for 
the Pinl WW domain residues was calculated in GRASP (Nicholls et aL, Proteins 
1 1 :281-296, 1991), and the acidic and basic residues depicted as the darker regions 
adjacent to Lys6, Glu35, Arg36, Arg21, and Argl7. 

Figure 3 shows a schematic and energetic view of the Pinl -YpSPTpSPS 
20 complex. Pinl residues are towards the bottom of the figure and are not covalently 
linked to the larger molecule which represents CTD. Residues participating in van 
der Waals contacts are highlighted with gray and the extended van der Waals surfaces 
appear as dotted gray curves. Hydrogen bonds are shown as dashed lines. In the case 
of the S16H and W34H mutants, some of the apparent binding is likely being 
25 contributed by the PPIase domain. Residues are given as single letter codes. 

Figure 4 collectively shows representative WW domains. Figure 4A shows 
the sequence alignment of 15 WW domains. The secondary structural elements are 
those of Pinl. The numbers correspond to the first residue on each line for each of the 
fifteen WW domains. The Darker gray boxes delineate residues participating in 
30 phosphate contacts. Shaded boxes corresponding to residues 23 and 24 define 
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residues participating in van der Waals contacts with Pro-3' of the CTD peptide. 
Shaded residues 14 and 25 contribute to the van der Waals surface sequestering the 
backbone of residues 4' to 6' as well as the Pro ring of Pro-6\ Light gray boxes 
corresponding to residue 32 define residues participating in additional hydrogen 
5 bonds with the CTD peptide. Residues participating in more then one class of 
interactions are outlined and coded as described above, h is human, Sc is 
Saccharomyces cerevisiae, d is Drosophila, m is mouse, and En is Emericella 
nidulans. Figure 4B shows stereo views of the Ca traces of the Pinl-YpSPTpSPS 
complex (top) and the YAP65-PPXY peptide complex (bottom) aligned with Pinl 's 

10 WW domain. The figure is labeled as in previous figures. The backbones of the WW 
domains are now shown as coils. The arrow in the bottom panel depicts the 
movement necessary to bring the PPXY-containing peptide into alignment with the 
CTD peptide bound to Pinl . The numbering scheme for YAP65 refers to the human 
sequence. The numbers in parentheses correspond to the alternate numbering scheme 

1 5 used by Macias et al (Nature 382:646-649, 1996). 

Figure 5 is a block diagram of a computer system. 



DETAILED DESCRIPTION OF THE INVENTION 

In accordance with the present invention, methods of identifying WW domain 
20 binding agents are provided. Invention methods include defining an interaction site of a 
WW domain based on a plurality of atomic coordinates of the WW domain, modeling a 
potential binding agent that fits spatially into said interaction site, contacting said 
potential binding agent with the WW domain in the presence of a WW domain substrate, 
and determining the ability of the potential binding agent to compete with the WW 
25 domain substrate for binding to the WW domain. 

In another aspect, the invention provides a computer program on a computer 
readable medium, said computer program having instructions to cause a computer to 
model a potential binding agent that fits spatially into a WW domain interaction site 
defined by a plurality of atomic coordinates. 



WW domains 
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The WW domain is a 38-40 amino acid structural motif that functions as an 
interaction module in a diverse set of signaling proteins. The modular nature of WW 
domain interactions lends itself to classification into five distinct groups based upon 
5 current understanding of their binding specificity. Group I WW domains, exemplified 
by YAP65, recognize PPxY motifs2; Group n WW domains, like FE65, bind the PPLP 
motif. Group III WW domains, such as those found in a subset of FBPs, interact with 
PGM motifs. A fourth group represented by the WW domain of Pinl, specifically 
interact with phosphorylated serine or threonine residues amino-terminal to proline, or 
10 P ho » ted SER-PRO (P.Ser-Pro) or P^oryiated^^ ^ #ThrPro) motifg FinaUy? ^ 

recently classified group V WW domains recognize PGR motifs. Pinl contains an N- 
terminal WW domain and a C-terminal peptidyl-prolyl cis-trans isomerase domain 
connected by a flexible linker. Provided here are the functional and structural basis for 
WW domain binding to phosphorylated protein substrates based on the energetic and 
1 5 structural analysis of a Pinl -phosphopeptide complex. 

Pinl has been implicated in the regulation of a diverse set of mitotically 
phosphorylated proteins including Cdc25, Mytl, Weel, Plkl, and Cdc27, and also 
Taul 1 . In addition, Pinl and its homolog, Esslp, have both been implicated in the 
regulation of the C-terminal domain (CTD) of RNA Polymerase II (Pol II) and they can 

20 both bind directly to the phosphorylated CTD. Phosphorylation of Ser-2' and/or Ser-5 ' 
on the multiple YSPTSPS heptad repeats in the CTD of Pol II by cyclin-dependent 
kinases (CDKs) converts Pol Ha to Pol Ho and initiates a cascade of transcriptional, 
processing, and splicing events linked to the elongating polymerase. Structural and 
functional analysis of Pinl established that Pinl 's PPIase domain (which belongs to the 

25 parvulin family of PPIases) catalyzes phosphorylation-dependent peptide bond 

isomerization in P.Ser-Pro segments. Finally, binding analyses demonstrate that the 
Pinl WW domain acts as the predominant P.Ser/P.Thr-Pro binding motif 

One aspect of the invention resides in obtaining crystals of the WW domain of 
a Pinl protein of sufficient quality to determine the three dimensional (tertiary) 
30 structure of the protein by X-ray diffraction methods. The knowledge obtained 
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concerning Pinl WW domain proteins can be used in the determination of the three 
dimensional structure of the binding domain of other WW domain containing proteins 
{e.g., Npw38 (Komuro et aL, Nucleic Acid Res. 27(9): 1957-65, 1999). The binding 
domain can also be predicted by various computer models. Based on the structural 
5 coordinates of the WW domain (i.e., the three dimensional protein structure of the 
binding domain), as described herein, small molecules which mimic the functional 
binding of a WW domain protein to its ligands can be designed and synthesized. 
Accordingly, in one embodiment, the invention provides a method of "rational" drug 
design. Another approach to "rational" drug design is based on a lead compound that 
10 is discovered using high throughput screens; the lead compound is further modified 
based on a crystal stucture of the binding regions of the molecule in question. 
Accordingly, another aspect of the invention is to provide starting materials for the 
rational design of drugs which prevent or mimic the action of a WW domain (e.g., 
Pinl WW domain) protein binding to its ligand. 

15 Pinl's amino acid sequence is known (Lu et al, supra). The WW domain of 

Pinl includes residues 1 to 39 and the PPIase domain includes residues 50-163 of 
Pinl. The term "amino acids" means the L-isomers of the naturally occurring amino 
acids or unnatural amino acids. The naturally occurring amino acids are glycine, 
alanine, valine, leucine, isoleucine, serine, methionine, threonine, phenylalanine, 

20 tyrosine, tryptophan, cysteine, proline, histidine, aspartic acid, asparagine, glutamic 
acid, glutamine, y-carboxyglutamic acid, arginine, ornithine and lysine. Unless 
specifically indicated, all amino acids referred to in this application are in the L-form. 

The term "unnatural amino acids" means amino acids that are not naturally 
found in proteins. Examples of unnatural amino acids used herein, include racemic 
25 mixtures of selenocysteine and selenomethionine. In addition, unnatural amino acids 
include the D or L forms of nor-leucine, para-nitrophenylalanine, homophenylalanine, 
para-fluorophenylalanine, 3-amino-2-benzylpropionic acid, homoarginine, and D- 
phenylalanine. 

The term "positively charged amino acid" includes any naturally occurring or 
30 unnatural amino acid having a positively charged side chain under normal 
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physiological conditions. Examples of positively charged naturally occurring amino 
acids are arginine, lysine and histidine. 

The term "negatively charged amino acid" includes any naturally occurring or 
unnatural amino acid having a negatively charged side chain under normal 
5 physiological conditions. Examples of negatively charged naturally occurring amino 
acids are aspartic acid and glutamic acid. 

The term "hydrophobic amino acid" means any amino acid having an 
uncharged, nonpolar side chain that is relatively insoluble in water. Examples of 
naturally occurring hydrophobic amino acids are alanine, leucine, isoleucine, valine, 
1 0 proline, phenylalanine, tryptophan and methionine. 

The term "hydrophilic amino acid" means any amino acid having an 
uncharged, polar side chain that is relatively soluble in water. Examples of naturally 
occurring hydrophilic amino acids are serine, threonine, tyrosine, asparagine, 
glutamine, and cysteine. 

15 The term "crystal structure coordinates" refers to mathematical coordinates 

derived from mathematical equations related to the patterns obtained on diffraction of 
a monochromatic beam of X-rays by the atoms (scattering centers) of a protein 
molecule in crystal form. The diffraction data are used to calculate an electron 
density map of the repeating unit of the crystal. The electron density maps are used to 

20 establish the positions of the individual atoms within the unit cell of the crystal. The 
crystal structure coordinates of the Pinl protein binding domain (e.g., the WW 
domain) are obtained from a Pinl protein crystal having orthorhombic space group 
symmetry P2i2i2i with a = 35.27 A, b - 43.90 A, c= 124.66 A, a = p - y = 90°. The 
coordinates of the Pinl protein binding domain can also be obtained by means of 

25 computational analysis. 

The term "selenomethione substitution" refers to the method of producing a 
chemically modified form of a crystal of Pinl. The Pinl protein is expressed by 
bacteria in media that is depleted in methionine and supplemented with 
selenomethionine. Selenium is thereby incorporated into the crystal in place of 
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methionine sulfurs. The location(s) of selenium are determined by X-ray diffraction 
analysis of the crystal. This information is used to generate the phase information 
used to construct a three-dimensional structure of the protein. 

The term "heavy atom derivatization" refers to the method of producing a 
5 chemically modified form of the crystal of Pinl. A crystal is soaked in a solution 
containing heavy metal atom salts or organometallic compounds, which can diffuse 
through the crystal and bind to the surface of the protein. The location(s) of the bound 
heavy metal atom(s) are determined by X-ray diffraction analysis of the soaked 
crystal. This information is used to generate the phase information used to construct a 
1 0 three-dimensional structure of the protein. 

The term "unit cell" refers to the basic parallelipiped shaped block. The entire 
volume of a crystal may be constructed by regular assembly of such blocks. 

The term "space group" refers to the arrangement of symmetry elements of a 

crystal. 

15 The term "molecular replacement" refers to a method that involves generating 

a preliminary model of a Pinl crystal whose structure coordinates are not known, by 
orienting and positioning a molecule whose structure coordinates are known. Phases 
are then calculated from this model and combined with observed amplitudes to give 
an approximate Fourier synthesis of the structure whose coordinates are known. 

20 The crystal structure coordinates of a Pinl protein and its WW domain can be 

used to design compounds that bind to the protein and alter its physical or 
physiological properties in a variety of ways. The structure coordinates of the protein 
can also be used to computationally screen small molecule data bases for compounds 
that bind to the protein to develop competitive, uncompetitive and non-competitive 

25 inhibitors of Pinl WW domain. 

A "competitive" inhibitor is one that inhibits a WW domain activity (e.g., a 
Pinl WW domain activity) by binding to the same kinetic form, of the WW domain, 
as its substrate binds-thus directly competing with the substrate for the active site of 
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the WW domain. Competitive inhibition can be reversed completely by increasing 
the substrate concentration. 



An "uncompetitive" inhibitor is one that inhibits WW domain activity (e.g., a 
Pinl WW domain activity) by binding to a different kinetic form of the active site 
5 than does the substrate. Such inhibitors bind to a WW domain already bound with the 
substrate and not to the free enzyme. Uncompetitive inhibition cannot be reversed 
completely by increasing the substrate concentration. 

A "non-competitive" inhibitor is one that can bind to either the free or 
substrate bound form of a WW domain. 

10 Those of skill in the art may identify inhibitors as competitive, uncompetitive 

or non-competitive, by computer fitting enzyme kinetic data using standard equations 
according to Segel, I. H., Enzyme Kinetics, J. Wiley & Sons, (1975). It should also be 
understood that uncompetitive or non-competitive inhibitors according to this 
invention may bind to an accessory binding site. 

15 Methods of using crystal structure data to design inhibitors of enzyme activity 

are known in the art. Thus, the crystal structure data provided herein can be used in 
the design of new or improved enzymatic inhibitors. For example, the Pinl WW 
domain coordinates, provided herein, can be superimposed onto other available 
coordinates of similar enzymes which have inhibitors bound to them to give an 

20 approximation of the way these and related inhibitors might bind to Pinl WW 

domains. Alternatively, computer programs employed in the practice of rational drug 
design can be used to identify compounds that reproduce interaction characteristics 
similar to those found between the Pinl WW domain and the co-crystalized substrate 
mimic. Furthermore, detailed knowledge of the nature of binding site interactions 

25 allows for the modification of compounds to alter or improve solubility, 
pharmacokinetics, etc. without affecting binding activity. 



Computer programs are widely available that are capable of carrying out the 
activities necessary to design compounds using the crystal structure information 
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provided herein. Examples include, but are not limited to, the computer programs 
listed below: 

Catalyst Databases™ - an information retrieval program accessing chemical databases 
such as BioByte Master File, Derwent WDI and ACD; 

5 Catalyst/HYPO™ - generates models of compounds and hypotheses to explain 
variations of activity with the structure of drug candidates; 

Ludi™ - fits molecules into the active site of a protein by identifying and matching 
complementary polar and hydrophobic groups; 

Leapfrog™ - "grows" new ligands using an algorithm with parameters under the 
10 control of the user. 

In addition, various general purpose machines may be used with programs 
written in accordance with the teachings herein, or it may be more convenient to 
construct more specialized apparatus to perform the operations. However, preferably the 
embodiment is implemented in one or more computer programs executing on 
1 5 programmable systems each comprising at least one processor, at least one data storage 
system (including volatile and non-volatile memory and/or storage elements), at least 
one input device, and at least one output device. The program is executed on the 
processor to perform the functions described herein. 

Each such program may be implemented in any desired computer language 
20 (including machine, assembly, high level procedural, or object oriented programming 
languages) to communicate with a computer system. In any case, the language may be a 
compiled or interpreted language. The computer program will typically be stored on a 
storage media or device (e.g., ROM, CD- ROM, or magnetic or optical media) readable 
by a general or special purpose programmable computer, for configuring and operating 
25 the computer when the storage media or device is read by the computer to perform the 
procedures described herein. The system may also be considered to be implemented as a 
computer-readable storage medium, configured with a computer program, where the 
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storage medium so configured causes a computer to operate in a specific and predefined 
manner to perform the functions described herein. 

Embodiments of the invention include systems (e.g., internet based systems), 
particularly computer systems which store and manipulate the coordinate and sequence 
5 information described herein. One example of a computer system 100 is illustrated in 
block diagram form in Figure 5. As used herein, "a computer system" refers to the 
hardware components, software components, and data storage components used to 
analyze the coordinates and sequences set forth in Table 1. The computer system 100 
typically includes a processor for processing, accessing and manipulating the sequence 
1 0 data. The processor 105 can be any well-known type of central processing unit, such as, 
for example, the Pentium m from Intel Corporation, or similar processor from Sun, 
Motorola, Compaq, AMD or International Business Machines. 

Typically the computer system 100 is a general purpose system that comprises 
15 the processor 105 and one or more internal data storage components 110 for storing data, 
and one or more data retrieving devices for retrieving the data stored on the data storage 
components. A skilled artisan can readily appreciate that any one of the currently 
available computer systems are suitable. 

20 In one particular embodiment, the computer system 100 includes a processor 105 

connected to a bus which is connected to a main memory 115 (preferably implemented 
as RAM) and one or more internal data storage devices 110, such as a hard drive and/or 
other computer readable media having data recorded thereon. In some embodiments, the 
computer system 100 further includes one or more data retrieving device 118 for reading 

25 the data stored on the internal data storage devices 110. 

The data retrieving device 118 may represent, for example, a floppy disk drive, a 
compact disk drive, a magnetic tape drive, or a modem capable of connection to a 
remote data storage system (e.g., via the internet) etc. In some embodiments, the internal 
30 data storage device 110 is a removable computer readable medium such as a floppy disk, 
a compact disk, a magnetic tape, etc. containing control logic and/or data recorded 
thereon. The computer system 100 may advantageously include or be programmed by 
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appropriate software for reading the control logic and/or the data from the data storage 
component once inserted in the data retrieving device. 

The computer system 100 includes a display 120 which is used to display output 
5 to a computer user. It should also be noted that the computer system 100 can be linked 
to other computer systems 125a-c in a network or wide area network to provide 
centralized access to the computer system 100. 

Software for accessing and processing the coordinate and sequences of Table 1, 
1 0 (such as search tools, compare tools, and modeling tools etc.) may reside in main 
memory 115 during execution. 

For the first time, the present invention permits the use of molecular design 
techniques to design, select and synthesize chemical entities and compounds, including 
1 5 inhibitory compounds, capable of binding to the active site or accessory binding site of a 
WW domain (e.g., a Pinl WW domain), in whole or in part. 

One approach enabled by this invention, is to use the structure coordinates set 
forth in Table 1 to design compounds that bind to the enzyme and alter the physical 
properties of the compounds in different ways, e.g., solubility. For example, this 

20 invention enables the design of compounds that act as competitive inhibitors of the WW 
domains by binding to, all or a portion of, the active site of a WW domain. This 
invention also enables the design of compounds that act as uncompetitive inhibitors of 
the WW domain (e.g., the WW domain of Pinl). These inhibitors may bind to, all or a 
portion of, the active site of a WW domain. Similarly, non-competitive inhibitors that 

25 bind to and inhibit a WW domain whether or not it is bound to another chemical entity 
may be designed using the structure coordinates of the invention as set forth in Table 1. 

In another approach a Pinl WW domain crystal is probed with molecules 
composed of a variety of different chemical entities to determine optimal sites for 
interaction between candidate binding molecules (e.g., inhibitors) and the WW domain 
30 active site. 
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In another embodiment, an approach made possible and enabled by this 
invention, is to screen computationally small molecule data bases for chemical entities or 
compounds that can bind in whole, or in part, to the WW domain. In this screening, the 
quality of fit of such entities or compounds to the binding site may be judged either by 
5 shape complementarity or by estimated interaction energy. Meng, E. C. et ah, J. Comp. 
Chem., 13, pp. 505-524 (1992). 

A number of WW domains exist, many of which have similar functional activity. 
However, many WW domains may crystallize in more than one crystal form. Thus, the 
structure coordinates of Pinl WW domain, or portions thereof, as provided by this 
10 invention are particularly useful to solve the structure of other crystal forms of WW 
domains. They may also be used to solve the structure of a WW domain or Pinl WW 
domain mutants, WW domain co-complexes, or the crystalline form of any other protein 
with significant amino acid sequence homology to any WW domain. 

One method that may be employed for this purpose is molecular replacement. In 
15 this method, the unknown crystal structure, whether it is another WW domain crystal 
form, a WW domain or Pinl WW domain mutant, or a WW domain co-complex, or the 
crystal of some other protein with significant amino acid sequence homology to any 
WW domain, may be determined using the structure coordinates as provided in Table 1. 
This method will provide an accurate structural form for the unknown crystal more 
20 quickly and efficiently than attempting to determine such information ab initio. Those 
of skill in the art understand that a set of structure coordinates determined by X-ray 
crystallography is not without standard error. 
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TABLE 1 



Atom 


Atom 
Type 


Residue 


# 


X 


Y 


Z 


OCC 


B 


1 


C 


GLY 


1 


.297 


6.261 


77.129 


1.00 


34 . 99 


2 


0 


GLY 


1 


.701 


5.443 


77.948 


1.00 


35 . 67 


3 


N 


GLY 


1 


-1. 549 


6.552 


75.579 


1.00 


35 . 60 


4 


CA 


GLY 


1 


- .294 


5 .819 


75.824 


1.00 


35 .38 


5 


N 


SER 


2 


.327 


7 .566 


77.321 


1.00 


34 .27 


6 


CA 


SER 


2 


.851 


8 .106 


78.547 


1. 00 


33 .27 


7 


CB 


SER 


2 


1.393 


9.518 


78.304 


1.00 


33 .43 


8 


OG 


SER 


2 


.397 


10 .429 


77 . 856 


1.00 


34 . 33 


9 


C 


SER 


2 


- .346 


8.118 


79.496 


1.00 


32 .42 


10 


0 


SER 


2 


-1.486 


8.145 


79.045 


1.00 


32 . 01 


11 


N 


HIS 


3 


- .093 


8.099 


80 . 798 


1. 00 


31.53 


12 


CA 


HIS 


3 


-1.153 


8.083 


81.796 


1 . 00 


30.70 


13 


CB 


HIS 


3 


- .530 


8 .274 


83 . 186 


1.00 


30 . 18 


14 


CG 


HIS 


3 


-1.523 


8 .316 


84 .304 


1 . 00 


29.99 


15 


CD 2 


HIS 


3 


-2 . 043 


7 .337 


85.077 


1.00 


29.88 


16 


ND1 


HIS 


3 


-2 . 102 


9.493 


84.732 


1.00 


29. 83 


17 


CE1 


HIS 


3 


-2 . 936 


9.228 


85 .729 


1.00 


29.76 


18 


NE2 


HIS 


3 


-2 . 915 


7 . 930 


85.954 


1.00 


29. 65 


19 


C 


HIS 


3 


-2.261 


9.111 


81.546 


1.00 


30.30 


20 


0 


HIS 


3 


-3 .428 


8 . 818 


81.767 


1.00 


30. 00 


21 


N 


GLY 


4 


-1. 899 


10.292 


81. 053 


1 . 00 


30 . 06 


22 


CA 


GLY 


4 


-2.891 


11.330 


80 . 812 


1.00 


30.26 


23 


C 


GLY 


4 


-3 .491 


11.392 


79.417 


1. 00 


30. 62 


24 


0 


GLY 


4 


-3.962 


12 .445 


78.990 


1. 00 


30.36 


25 


N 


MET 


5 


-3 .484 


10.272 


78 . 702 


1.00 


31. 06 


26 


CA 


MET 


5 


-4 . 029 


10.239 


77 .349 


1. 00 


32 . 14 


27 


CB 


MET 


5 


-2.900 


10.330 


76. 320 


1.00 


33 .56 


28 


CG 


MET 


5 


-1 . 877 


11.379 


76.665 


1.00 


35. 79 


29 


SD 


MET 


5 


-1.479 


12.502 


75.352 


1.00 


39.48 


30 


CE 


MET 


5 


-2 . 891 


13 . 512 


75.389 


1.00 


38. 05 


31 


C 


MET 


5 


-4 .816 


8.967 


77. 107 


1.00 


32 . 01 


32 


0 


MET 


5 


-4.592 


7.954 


77.770 


1. 00 


31.57 


33 


N 


ALA 


6 


-5 . 743 


9. 024 


76. 156 


1.00 


32 . 09 


34 


CA 


ALA 


6 


-6.544 


7. 858 


75. 823 


1.00 


32 . 75 


35 


CB 


ALA 


6 


-7.620 


8.224 


74 . 808 


1.00 


32 .26 


36 


C 


ALA 


6 


-5 .593 


6. 840 


75 .226 


1.00 


33 . 51 


37 


0 


ALA 


6 


-4 . 579 


7.208 


74.625 


1.00 


32.76 


38 


N 


ASP 


7 


-5 . 900 


5 . 561 


75 .408 


1.00 


34 . 83 


39 


CA 


ASP 


7 


-5.050 


4.523 


74 . 847 


1. 00 


36.77 


40 


CB 


ASP 


7 


-5 . 191 


3.201 


75 .574 


1. 00 


36. 60 


41 


CG 


ASP 


7 


-4 .345 


3 . 133 


76.794 


1.00 


36.46 


42 


OD1 


ASP 


7 


-3.300 


3.815 


76.824 


1.00 


36.44 


43 


OD2 


ASP 


7 


-4 .722 


2 .392 


77 . 717 


1.00 


36.77 


44 


C 


ASP 


7 


-5 .426 


4.277 


73 .431 


1.00 


38.24 


45 


O 


ASP 


7 


-6.399 


4 . 833 


72 . 926 


1.00 


38.39 


46 


N 


GLU 


8 


-4 .675 


3.410 


72 . 778 


1.00 


39. 91 


47 


CA 


GLU 


8 


-5 .037 


3 . 176 


71.420 


1.00 


41.61 


48 


CB 


GLU 


8 


-5 . 005 


4.516 


70 . 665 


1.00 


42 .47 


49 


CG 


GLU 


8 


-3 .630 


5.196 


70 . 661 


1.00 


43 . 94 


50 


CD 


GLU 


8 


-3 . 700 


6.679 


70 .399 


1.00 


44 . 72 


51 


OE1 


GLU 


8 


-3.249 


7.133 


69.317 


1.00 


45.25 



15 



52 


OE2 


GLU 


8 


-4.209 


7.410 


71.267 


1.00 


45.66 I 


53 


C 


GLU 


8 


-4.350 


2 . 129 


70 .591 


1.00 


42 . 09 


54 


0 


GLU 


8 


-3 .395 


1.444 


70 . 989 


1.00 


42 .27 


55 


N 


GLU 


9 


-4 . 940 


2 . 032 


69.418 


1.00 


42 . 95 


56 


CA 


GLU 


9 


-4 . 606 


1.219 


68.258 


1. 00 


43 . 60 


57 


CB 


GLU 


9 


-3 .972 


2 . 124 


67.200 


1.00 


44 .31 


58 


CG 


GLU 


9 


-3 .671 


3 .542 


67.569 


1.00 


45.81 


59 


CD 


GLU 


9 


-4 .496 


4.518 


66.738 


1.00 


46.50 


60 


OE1 


GLU 


9 


-5.535 


4 . 977 


67.255 


1.00 


47.27 


61 


OE2 


GLU 


9 


-4.138 


4 . 816 


65.573 


1.00 


47.08 


62 


C 


GLU 


9 


-3 .933 


- . 102 


68.005 


1. 00 


43 .44 


63 


O 


GLU 


9 


-3 .420 


- . 838 


68 . 845 


1. 00 


43 . 85 


64 


N 


LYS 


10 


-3 . 912 


- .276 


66.690 


1.00 


43 .38 


65 


CA 


LYS 


10 


-3 .326 


-1.359 


65 . 963 


1. 00 


42 .60 


66 


CB 


LYS 


10 


-4 .363 


-2 . 078 


65.091 


1. 00 


43 .20 


67 


CG 


LYS 


10 


-5 . 105 


-3 .248 


65.688 


1. 00 


43 . 83 


68 


CD 


LYS 


10 


-6.580 


-2 . 891 


65.718 


1. 00 


44 . 18 


69 


CE 


LYS 


10 


-7.473 


-3 . 925 


65.065 


1.00 


44 .53 


70 


NZ 


LYS 


10 


-8.832 


-3 . 339 


64 . 817 


1 . 00 


44 . 81 


71 


C 


LYS 


10 


-2 .521 


-.539 


64.995 


1.00 


41.77 


72 


O 


LYS 


10 


-1.396 


- . 087 


65 .248 


1 . 00 


41.98 


73 


N 


LEU 


11 


-3 .210 


- .290 


63 .894 


1. 00 


40 . 35 


74 


CA 


LEU 


11 


-2 . 678 


.402 


62 .758 


1.00 


38. 97 


75 


CG 


LEU 


11 


-2 . 843 


2 .889 


63.092 


1.00 


38. 96 


76 


CD1 


LEU 


11 


-4 . 128 


2 .512 


63.814 


1.00 


38. 82 


77 


CD 2 


LEU 


11 


-2 .222 


4 . 141 


63 .685 


1 . 00 


38 . 60 


78 


C 


LEU 


11 


-1. 778 


- . 681 


62 .212 


1.00 


37 .68 


79 


0 


LEU 


11 


- . 725 


- . 986 


62 .778 


1.00 


37 . 95 


80 


CB 


LEU 


11 


-1. 911 


1. 672 


63 .158 


1.00 


38 . 94 


81 


N 


PRO 


12 


-2 .216 


-1.315 


61.126 


1.00 


36.04 


82 


CD 


PRO 


12 


-3 . 360 


- . 954 


60.273 


1 . 00 


35.90 


83 


CA 


PRO 


12 


-1.421 


-2 .381 


60.519 


1 . 00 


34 .36 


84 


CB 


PRO 


12 


-2 .183 


-2 .692 


59.236 


1 . 00 


34 .88 


85 


CG 


PRO 


12 


-3 .597 


-2.239 


59.553 


1.00 


35.30 


86 


C 


PRO 


12 


- . 049 


-1.801 


60.240 


1.00 


32 .67 


87 


0 


PRO 


12 


. 151 


- .586 


60 . 330 


1. 00 


32 .19 


88 


N 


PRO 


13 


. 929 


-2 .652 


59 . 924 


1.00 


31.08 


89 


CD 


PRO 


13 


. 963 


-4 . 125 


59 . 872 


1.00 


30 . 89 


90 


CA 


PRO 


13 


2 .248 


-2 . 084 


59.654 


1.00 


29.59 


91 


CB 


PRO 


13 


3 .042 


-3 .293 


59.165 


1.00 


30 . 16 


92 


CG 


PRO 


13 


2 .438 


-4 .419 


59 . 971 


1. 00 


30.57 


93 


C 


PRO 


13 


2.140 


-.985 


58.593 


1.00 


28.22 


94 


0 


PRO 


13 


1.302 


-1. 062 


57.689 


1.00 


27.79 


95 


N 


GLY 


14 


2 .966 


. 047 


58.737 


1.00 


26 . 74 


96 


CA 


GLY 


14 


2 .970 


1. 140 


57.783 


1. 00 


25 .25 


97 


C 


GLY 


14 


2 . 066 


2 .317 


58 . 092 


1. 00 


24 .24 


98 


0 


GLY 


14 


2 .264 


3 .403 


57.553 


1. 00 


23.92 


99 


N 


TRP 


15 


1.081 


2.124 


58.960 


1. 00 


23 .51 


100 


CA 


TRP 


15 


.160 


3 .205 


59.286 


1. 00 


22 . 91 


101 


CB 


TRP 


15 


-1.275 


2 .679 


59.368 


1. 00 


22 .23 


102 


CG 


TRP 


15 


-1 . 845 


2 .209 


58 . 065 


1 . 00 


21 . 67 


103 


CD 2 


TRP 


15 


-2 .524 


3.010 


57 .092 


1.00 


21.24 


104 


CE2 


TRP 


15 


-2.898 


2.152 


56.032 


1.00 


21.30 


105 


CE3 


TRP 


15 


-2 . 853 


4.371 


57 .011 


1 . 00 


20.98 


106 


CD1 


TRP 


15 


-1.832 


.935 


57 .573 


1 . 00 


21.71 


107 


NE1 


TRP 


15 


-2 .462 


.893 


56 .352 


1.00 


21.40 



16 



108 


CZ2 


TRP 


15 


-3 .588 


2 . 609 


54 . 902 


1 . 00 


21.06 


109 


CZ3 


TRP 


15 


-3 . 540 


4 . 829 


55 .884 


1 . 00 


21 . 18 


110 


CH2 


TRP 


15 


-3 . 898 


3 . 946 


54 . 845 


1 . 00 


21 . 04 


111 


C 


TRP 


15 


.470 


3 . 954 


60 . 575 


1 . 00 


23 . 13 


112 


o 


TRP 


15 


1. 032 


3 .402 


61.519 


1 . 00 


22 . 54 


113 


N 


GLU 


16 


. 085 


5.223 


60.596 


1 . 00 


23 .37 


114 


CA 


GLU 


16 


.267 


6. 065 


61. 763 


1 . 00 


24 . 62 


115 


CB 


GLU 


16 


1.619 


6. 790 


61. 698 


1. 00 


24 . 57 


116 


CG 


GLU 


16 


1.788 


7. 743 


60 . 532 


1. 00 


24 . 62 


117 


CD 


GLU 


16 


3 .200 


8.311 


60.448 


1. 00 


24 . 61 


118 


OE1 


GLU 


16 


4 . 114 


7.598 


59.988 


1.00 


24.24 


119 


OE2 


GLU 


16 


3 .395 


9.470 


60 . 857 


1 . 00 


24 . 99 


12 0 


C 


GLU 


16 


- . 890 


7. 063 


61.812 


1.00 


25.27 


121 


0 


GLU 


16 


-1.487 


7.384 


60 .784 


1 . 00 


24 . 68 


122 


N 


LYS 


17 


-1.224 


7.526 


63 . 012 


1 . 00 


26 . 55 


123 


CA 


LYS 


17 


-2 .307 


8.486 


63 . 173 


1.00 


28.35 


124 


CB 


LYS 


17 


-3 . 042 


8.285 


64 .502 


1.00 


29.42 


125 


CG 


LYS 


17 


-4.042 


9.410 


64 . 769 


1.00 


31.85 


12 6 


CD 


LYS 


17 


-4.548 


9.476 


66.204 


1.00 


33 .36 


12 7 


CE 


LYS 


17 


-5.308 


8.228 


66.599 


1 . 00 


34 .45 


12 8 


NZ 


LYS 


17 


-4.377 


7 . 166 


67 . 041 


1 . 00 


36 . 19 


129 


C 


LYS 


17 


-1.781 


9. 910 


63 . 142 


1 . 00 


29 . 19 


130 


O 


LYS 


17 


- .680 


10.188 


63 . 617 


1.00 


28 . 99 


131 


N 


ARG 


18 


-2 . 577 


10 . 818 


62 .593 


1 . 00 


30.37 


132 


CA 


ARG 


18 


-2 . 184 


12 .215 


62 . 536 


1 . 00 


31 . 87 


133 


CB 


ARG 


18 


-1.435 


12 . 482 


61 . 233 


1 . 00 


32 . 64 


134 


CG 


ARG 


18 


- .238 


11.563 


61 . 069 


1 . 00 


34 . 24 


135 


CD 


ARG 


18 


.498 


11 . 835 


59 .791 


1 . 00 


35.78 


136 


NE 


ARG 


18 


1. 196 


13 . 109 


59 . 859 


1 . 00 


37.48 


137 


CZ 


ARG 


18 


2 .517 


13 .230 


59. 884 


1 . 00 


37 . 91 


138 


NH1 


ARG 


18 


3 . 066 


14 .428 


59 . 952 


1 . 00 


38.43 


139 


NH2 


ARG 


18 


3 .291 


12 . 154 


59.830 


1 . 00 


38 . 32 


14 0 


C 


ARG 


18 


-3.394 


13 . 127 


62 . 672 


1 . 00 


32 .46 


141 


0 


ARG 


18 


-4.531 


12 . 658 


62 . 697 


1 . 00 


32.20 


14 2 


N 


MET 


19 


-3 . 146 


14 .428 


62 .777 


1 .00 


33 . 73 


14 3 


CA 


MET 


19 


-4 .224 


15 . 396 


62 . 922 


1 . 00 


35 . 12 


144 


CB 


MET 


19 


-4 . 089 


16 . 168 


64 . 236 


1 . 00 


36 . 75 


145 


CG 


MET 


19 


-4.642 


15 .457 


65 .452 


1 . 00 


39. 07 


14 6 


SD 


MET 


19 


-5.206 


16. 611 


66 . 735 


1 . 00 


41.46 


14 7 


CE 


MET 


19 


-5.153 


18.203 


65 . 865 


1 . 00 


41. 00 


148 


C 


MET 


19 


-4.293 


16.405 


61.790 


1 . 00 


35 .20 


149 


O 


MET 


19 


-3 .269 


16. 886 


61.306 


1 . 00 


35 . 22 


150 


N 


SER 


20 


-5.514 


16. 727 


61.381 


1.00 


35 .36 


151 


CA 


SER 


20 


-5 . 740 


17 .705 


60 . 328 


1 . 00 


35 . 66 


152 


CB 


SER 


20 


-7 . 184 


17 . 609 


59. 825 


1 . 00 


35 . 65 


153 


OG 


SER 


20 


-7 . 675 


18 . 875 


59 .414 


1 . 00 


35 . 43 


154 


c 


SER 


20 


-5.480 


19 . 098 


60.891 


1 . 00 


36.12 


155 


0 


SER 


20 


-6. 091 


19.496 


61. 883 


1. 00 


35 . 85 


156 


N 


ARG 


21 


-4 .557 


19.828 


60.272 


1. 00 


36.76 


157 


CA 


ARG 


21 


-4 .238 


21.186 


60. 711 


1.00 


37.41 


158 


CB 


ARG 


21 


-3 . 125 


21.796 


59 . 844 


1.00 


38.05 


159 


CG 


ARG 


21 


-1. 694 


21.415 


60.217 


1. 00 


39.57 


160 


CD 


ARG 


21 


-1. 064 


20 .438 


59.224 


1. 00 


41. 03 


161 


NE 


ARG 


21 


-1.805 


20 .360 


57. 971 


1.00 


42 . 01 


162 


CZ 


ARG 


21 


-1.230 


20 .319 


56. 770 


1. 00 


42 . 70 


163 


NH1 


ARG 


21 


. 094 


20.355 


56.660 


1.00 


43.34 
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164 


MH2 


ARG 


21 


-1. 972 


20.225 


55 . 672 


1.00 


43 .27 


165 


C 


ARG 


21 


-5.488 


22 . 053 


60 .579 


1.00 


37.21 


166 


0 


ARG 


21 


-5.701 


22 . 983 


61.354 


1.00 


37.49 


167 


N 


SER 


22 


-6.311 


21.729 


59.589 


1.00 


36.91 


168 


CA 


SER 


22 


-7 .527 


22 .473 


59.303 


1. 00 


36.54 


169 


CB 


SER 


22 


-7.950 


22.201 


57. 856 


1. 00 


37 .25 


170 


OG 


SER 


22 


-9.308 


22 . 541 


57.643 


1 . 00 


37.97 


171 


c 


SER 


22 


-8 .726 


22 .265 


60.227 


1.00 


35.96 


172 


0 


SER 


22 


-9.326 


23 .235 


60.690 


1. 00 


36.13 


173 


N 


SER 


23 


-9.076 


21.012 


60.496 


1.00 


34.79 


174 


CA 


SER 


23 


-10.240 


20 . 720 


61.330 


1. 00 


33 .65 


175 


CB 


SER 


23 


-11. 163 


19. 756 


60.584 


1 . 00 


33 .79 


176 


OG 


SER 


23 


-10.514 


18.517 


60.357 


1.00 


34 . 00 


177 


C 


SER 


23 


-9. 957 


20.148 


62 .715 


1.00 


32 . 65 


178 


O 


SER 


23 


-10 . 855 


20. 100 


63 .558 


1.00 


32 .30 


179 


N 


GLY 


24 


-8. 723 


19. 716 


62 . 951 


1 . 00 


31.54 


180 


CA 


GLY 


24 


-8.383 


19.130 


64 .236 


1.00 


30.56 


181 


c 


GLY 


24 


-8 . 857 


17.686 


64 .277 


1.00 


29. 75 


182 


0 


GLY 


24 


-8.703 


16.981 


65 .276 


1.00 


29.78 


183 


N 


ARG 


25 


-9 .445 


17 .251 


63 . 169 


1.00 


28. 87 


184 


CA 


ARG 


25 


-9 . 963 


15 . 896 


63 . 019 


1 . 00 


28.00 


185 


CB 


ARG 


25 


-10 . 936 


15 .884 


61. 838 


1 .00 


29.15 


186 


CG 


ARG 


25 


-11. 631 


14 . 576 


61. 522 


1 .00 


30.49 


187 


CD 


ARG 


25 


-12 .548 


14 .810 


60.330 


1.00 


31.64 


188 


NE 


ARG 


25 


-13 . 153 


13 .592 


59. 806 


1.00 


32 . 91 


189 


CZ 


ARG 


25 


-13 . 796 


13 .531 


58 . 645 


1. 00 


32 . 93 


190 


NH1 


ARG 


25 


-13 . 913 


14 . 617 


57. 892 


1.00 


33 .35 


191 


NH2 


ARG 


25 


-14 .316 


12 .384 


58 .232 


1. 00 


33 .50 


192 


C 


ARG 


25 


-8 . 806 


14 . 916 


62.783 


1. 00 


27 . 17 


193 


0 


ARG 


25 


-7.846 


15.235 


62.081 


1.00 


26.80 


194 


N 


VAL 


26 


-8 . 894 


13 . 732 


63 .380 


1.00 


25 . 90 


195 


CA 


VAL 


26 


-7. 851 


12 . 717 


63 .223 


1.00 


25.40 


196 


CB 


VAL 


26 


-7 . 943 


11. 640 


64 . 341 


1 . 00 


25 .46 


197 


CGI 


VAL 


26 


-7 . 058 


10 .451 


64 . 006 


1. 00 


25. 92 


198 


CG2 


VAL 


26 


-7.524 


12 .233 


65.675 


1.00 


25. 82 


199 


C 


VAL 


26 


-7.960 


12 . 007 


61. 877 


1.00 


24 . 87 


200 


0 


VAL 


26 


-9.054 


11. 824 


61.353 


1.00 


25.20 


201 


N 


TYR 


27 


-6.819 


11. 621 


61.313 


1.00 


24.40 


202 


CA 


TYR 


27 


-6.803 


10 . 886 


60.051 


1.00 


23 .42 


203 


CG 


TYR 


27 


-5 .442 


12 .563 


58.582 


1 . 00 


24 . 14 


204 


CD1 


TYR 


27 


-4 .434 


11. 985 


57. 803 


1.00 


24 . 61 


205 


CE1 


TYR 


27 


-3 .266 


12 .688 


57 .496 


1.00 


24 . 81 


206 


CD 2 


TYR 


27 


-5 . 245 


13 . 858 


59 . 055 


1 . 00 


24 .39 


207 


CE2 


TYR 


27 


-4 .084 


14 .567 


58 . 758 


1 . 00 


25 . 05 


208 


CZ 


TYR 


27 


-3 . 100 


13 . 981 


57 . 978 


1. 00 


25.21 


209 


OH 


TYR 


27 


-1. 962 


14 . 698 


57 .683 


1.00 


25. 76 


210 


C 


TYR 


27 


-5 . 624 


9.925 


60 . 069 


1 . 00 


23 . 02 


211 


O 


TYR 


27 


-4 .774 


9.999 


60 .960 


1 . 00 


22 . 89 


212 


CB 


TYR 


27 


-6.746 


11.836 


58.839 


1.00 


23 .85 


213 


N 


TYR 


28 


-5 .575 


9.014 


59.103 


1.00 


21.88 


214 


CA 


TYR 


28 


-4 .504 


8 .034 


59 . 074 


1 . 00 


21 . 16 


215 


CB 


TYR 


28 


-5. 095 


6.632 


59 .223 


1.00 


22 .27 


216 


CG 


TYR 


28 


-5 . 914 


6 .520 


60 .488 


1.00 


23 .24 


217 


CD1 


TYR 


28 


-7.210 


7 . 032 


60 .550 


1.00 


24.01 


218 


CE1 


TYR 


28 


-7.926 


7.036 


61.745 


1.00 


24 . 83 


219 


CD 2 


TYR 


28 


-5.358 


5 . 997 


61.652 


1 . 00 


23 . 92 
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220 


CE2 


TYR 


28 


-6. 062 


5.996 


62 . 852 


1. 00 


24. 76 


221 


CZ 


TYR 


28 


-7.343 


6.519 


62 . 891 


1. 00 


25.17 


222 


OH 


TYR 


28 


-8. 019 


6.553 


64.087 


1.00 


26.30 


223 


C 


TYR 


28 


-3 .632 


8.138 


57.837 


1. 00 


20 .21 


224 


O 


TYR 


28 


-4 . 118 


8 . 323 


56.725 


1.00 


19. 01 


225 


N 


PHE 


29 


-2 .332 


8.017 


58.065 


1.00 


19.58 


226 


CA 


PHE 


29 


-1.327 


8.134 


57.024 


1.00 


18 . 83 


227 


CB 


PHE 


29 


- .461 


9.359 


57.344 


1.00 


19.22 


228 


CG 


PHE 


29 


.776 


9.488 


56.506 


1.00 


19.30 


229 


CD1 


PHE 


29 


. 691 


9. 778 


55. 149 
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1232 


OE2 


GLU 


167 


-3 .667 


-10 .238 


38.277 


1.00 


30 .37 


1233 


C 


GLU 


167 


-3 . 684 


-6.816 


41. 945 


1 . 00 


21.79 


1234 


0 


GLU 


167 


-2 . 642 


-6 . 145 


42 . 090 


1 . 00 


21.19 


1235 


OT 


GLU 


167 


-3 . 931 


-7 . 864 


42 .577 


1. 00 


21.37 


1236 


CB 


TYR 


170 


7 . 118 


13 .530 


52 . 814 


1. 00 


55 .56 


1237 


CG 


TYR 


170 


5.703 


13 .351 


52 .248 


1.00 


56.20 


1238 


GDI 


TYR 


170 


4.952 


12 .203 


52 .521 


1.00 


56 .47 


1239 


CE1 


TYR 


170 


3 . 645 


12 .044 


52 . 017 


1.00 


56.62 


1240 


CD 2 


TYR 


170 


5. 114 


14 .339 


51.451 


1.00 


56 .44 


1241 


CE2 


TYR 


170 


3 . 807 


14 .189 


50 . 943 


1.00 


56.66 


1242 


CZ 


TYR 


170 


3.081 


13.043 


51.231 


1.00 


56.70 


1243 


OH 


TYR 


170 


1.793 


12 .904 


50.754 


1.00 


57 .13 


1244 


C 


TYR 


170 


6.194 


14 .920 


54.642 


1.00 


54 .37 


1245 


0 


TYR 


170 


5.970 


13 .961 


55.356 


1.00 


54 .42 


1246 


N 


TYR 


170 


8.641 


14 . 888 


54.282 


1.00 


55 . 08 


1247 


CA 


TYR 


170 


7.303 


14 .840 


53.619 


1.00 


54.90 


1248 


N 


SEP 


171 


5. 893 


16.138 


54.270 


1.00 


53 .40 


1249 


CA 


SEP 


171 


4. 911 


16.155 


55.342 


1 . 00 


52 .46 


1250 


CB 


SEP 


171 


4 . 976 


17 .400 


56.248 


1 . 00 


53 . 58 


1251 


OG 


SEP 


171 


5 .418 


17 . 088 


57 . 603 


1 . 00 


55.25 


1252 


C 


SEP 


171 


3 .548 


15.967 


54 . 667 


1.00 


50 . 95 


1253 


0 


SEP 


171 


3 . 041 


16 . 860 


53 . 999 


1 . 00 


50 . 72 


1254 


P 


SEP 


171 


4.475 


17 . 349 


58 . 906 


1.00 


56.21 


1255 


OIP 


SEP 


171 


5.346 


17.261 


60 . 120 


1 . 00 


56 .25 


1256 


02P 


SEP 


171 


4 . 006 


18 . 761 


58 . 948 


1 . 00 


56.33 


1257 


03P 


SEP 


171 


3.279 


16 .493 


58.749 


1.00 


56.44 


1258 


N 


PRO 


172 


2 . 718 


14 . 784 


54 . 962 


1.00 


49. 69 


1259 


CD 


PRO 


172 


3 . 094 


13 . 822 


56.010 


1.00 


49.43 


1260 


CA 


PRO 


172 


1.410 


14.432 


54 .383 


1.00 


48 .40 


1261 


CB 


PRO 


172 


1. 020 


13 . 160 


55 . 148 


1 . 00 


48 . 84 


1262 


CG 


PRO 


172 


1.752 


13 .280 


56.446 


1.00 


49 . 09 


1263 


C 


PRO 


172 


.290 


15 .467 


54 .378 


1.00 


47 .21 


1264 


O 


PRO 


172 


.229 


16.373 


55 .213 


1.00 


46.73 


1265 


N 


THR 


173 


-.610 


15.312 


53 .417 


1.00 


45 . 94 


1266 


CA 


THR 


173 


-1.744 


16.207 


53 .287 


1.00 


44 . 92 


1267 


CB 


THR 


173 


-1. 980 


16.579 


51. 817 


1. 00 


45 . 05 


1268 


OG1 


THR 


173 


- . 722 


16.601 


51.131 


1 . 00 


45 .35 


1269 


CG2 


THR 


173 


-2 . 601 


17.960 


51.713 


1 . 00 


45.23 


1270 


C 


THR 


173 


-2 . 965 


15 .472 


53 . 824 


1 . 00 


44 . 00 


1271 


O 


THR 


173 


-3 . 048 


14.246 


53 . 740 


1 . 00 


43 . 67 


1272 


N 


SEP 


174 


-3 . 901 


16.219 


54.394 


1 . 00 


43 . 11 


1273 


CA 


SEP 


174 


-5.107 


15 . 613 


54 . 928 


1.00 


42 . 34 


1274 


CB 


SEP 


174 


-5.982 


16. 661 


55.613 


1.00 


41.77 


1275 


OG 


SEP 


174 


-5.285 


17.278 


56.677 


1.00 


40.78 


1276 


C 


SEP 


174 


-5.859 


14 . 947 


53 . 788 


1.00 


42 . 13 


1277 


O 


SEP 


174 


-5. 803 


15.405 


52 . 648 


1.00 


42 . 16 


1278 


P 


SEP 


174 


-5 . 017 


18 . 848 


56.671 


1.00 


40 .24 


1279 


OIP 


SEP 


174 


-3 . 939 


19. 053 


57.657 


1. 00 


39.89 


1280 


02P 


SEP 


174 


-4 .410 


19.263 


55.377 


1. 00 


40.27 


1281 


03P 


SEP 


174 


-6.331 


19.526 


56.783 


1. 00 


40 . 16 


1282 


N 


PRO 


175 


-6.571 


13 . 850 


54 . 079 


1. 00 


42 . 06 
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1283 


CD 


PRO 


175 


-6. 744 


13 .234 


55.405 


1 . 00 


41. 84 


1284 


CA 


PRO 


175 


-7.333 


13 . 122 


53 . 058 


1.00 


41. 94 


1285 


CB 


PRO 


175 


-8. 043 


12 . 029 


53 . 862 


1.00 


41.89 


1286 


CG 


PRO 


175 


-8.070 


12 . 566 


55.257 


1.00 


42 . 04 


1287 


C 


PRO 


175 


-8.289 


13 . 969 


52 .217 


1.00 


42.11 


1288 


0 


PRO 


175 


-8.502 


13 . 678 


51. 039 


1.00 


42 . 01 


1289 


N 


SER 


176 


-8.863 


15 . 011 


52 .812 


1. 00 


42 . 14 


1290 


CA 


SER 


176 


-9 . 777 


15.887 


52 . 081 


1. 00 


42. 14 


1291 


CB 


SER 


176 


-11.199 


15.310 


52 . 073 


1.00 


42.12 


1292 


OG 


SER 


176 


-11. 820 


15.434 


53 .340 


1.00 


41.96 


1293 


C 


SER 


176 


-9.791 


17 .273 


52 .709 


1.00 


42 .33 


1294 


0 


SER 


176 


-9.105 


17 .452 


53 . 738 


1.00 


42 .23 


1295 


OT 


SER 


176 


-10.485 


18 .159 


52 . 167 


1 .00 


42 .54 


1296 


OH2 


WAT 


200 


2.014 


6.388 


33 . 070 


1.00 


13 . 93 


1297 


OH2 


WAT 


201 


-1.283 


-2 .300 


28. 997 


1.00 


15.35 


1298 


OH2 


WAT 


202 


-2 . 992 


-1.687 


35 . 759 


1.00 


21.83 


1299 


OH2 


WAT 


203 


-2 .513 


19.277 


39.236 


1. 00 


14 .45 


1300 


OH2 


WAT 


204 


-6 . 092 


9. 665 


81 . 760 


1 . 00 


17 .79 


1301 


OH2 


WAT 


205 


1.926 


17 .345 


34 . 818 


1.00 


15 .51 


1302 


OH2 


WAT 


206 


-2 . 638 


.254 


52 . 032 


1 . 00 


18 .20 


1303 


OH2 


WAT 


207 


11. 754 


22 .376 


44 . 104 


1. 00 


20 . 03 


1304 


OH2 


WAT 


208 


9.320 


-4 . 615 


42 . 075 


1. 00 


21.68 


1305 


OH2 


WAT 


209 


4 .413 


10 . 176 


28.305 


1 . 00 


24 . 86 


1306 


OH2 


WAT 


210 


3 .500 


15.584 


33.508 


1. 00 


17 . 17 


1307 


OH2 


WAT 


211 


3 .016 


- .603 


29. 684 


1.00 


16.24 


1308 


OH2 


WAT 


212 


-3.433 


- .744 


29. 090 


1 . 00 


20.34 


1309 


OH2 


WAT 


213 


-3 . 928 


16.386 


47 . 143 


1.00 


19.86 


1310 


OH2 


WAT 


214 


-1.492 


19. 796 


34.357 


1.00 


23 . 80 


1311 


OH2 


WAT 


215 


5 .749 


17. 029 


31. 175 


1.00 


17.04 


1312 


OH2 


WAT 


216 


-7.589 


4 .674 


47 .250 


1.00 


24 .91 


1313 


OH2 


WAT 


217 


5 . 836 


-2 . 876 


50 .964 


1.00 


22 . 06 


1314 


OH2 


WAT 


218 


11. 848 


8 . 037 


47 .950 


1.00 


23 .46 


1315 


OH2 


WAT 


219 


12 .348 


13 .307 


37 . 042 


1.00 


20 .57 


1316 


OH2 


WAT 


220 


-4.113 


9.169 


47.403 


1.00 


23 .76 


1317 


OH2 


WAT 


221 


5 .203 


22 .317 


35 . 018 


1.00 


22 . 18 


1318 


OH2 


WAT 


222 


9.525 


6.324 


30 . 502 


1.00 


26.39 


1319 


OH2 


WAT 


223 


-10 .609 


9.339 


53 .354 


1 . 00 


30 . 61 


1320 


OH2 


WAT 


224 


-3 . 990 


10.308 


49. 848 


1 . 00 


21. 74 


1321 


OH2 


WAT 


225 


9 . 853 


25 . 698 


42 .396 


1 . 00 


29. 89 


1322 


OH2 


WAT 


226 


-6.397 


11 . 568 


75 . 148 


1 . 00 


21 . 91 


1323 


OH2 


WAT 


227 


7 . 548 


3.839 


49.241 


1 . 00 


24 . 61 


1324 


OH2 


WAT 


228 


-6 . 083 


4 . 855 


51 . 274 


1 . 00 


23 . 65 


1325 


OH2 


WAT 


229 


-6.550 


7.286 


50 .590 


1 . 00 


23 . 55 


1326 


OH2 


WAT 


230 


. 976 


11 . 832 


28. 657 


1 . 00 


18 . 86 


1327 


OH2 


WAT 


231 


-5 . 723 


7 . 029 


48 . 100 


1 . 00 


26 . 00 


1328 


OH2 


WAT 


232 


3 .126 


23 . 015 


39.476 


1.00 


21. 91 


1329 


OH2 


WAT 


233 


-10.950 


13 . 155 


65.324 


1 .00 


25. 05 


1330 


OH2 


WAT 


234 


-2 .711 


12 .584 


49. 681 


1.00 


29.21 


1331 


OH2 


WAT 


235 


7 .434 


-10.766 


44 . 709 


1.00 


25. 02 


1332 


OH2 


WAT 


236 


-4 .201 


1. 994 


25.179 


1.00 


21.85 


1333 


OH2 


WAT 


237 


-2 . 892 


14 . 979 


79.350 


1.00 


29.32 


1334 


OH2 


WAT 


238 


14 .239 


5.232 


43 .399 


1.00 


27 .32 


1335 


OH2 


WAT 


239 


-3 . 070 


-1 . 184 


54.587 


1. 00 


30.22 


1336 


OH2 


WAT 


240 


-11. 964 


9.420 


40.306 


1. 00 


24.61 


1337 


OH2 


WAT 


241 


-2 . 664 


20 . 784 


36.612 


1 . 00 


25.57 


1338 


OH2 


WAT 


242 


15.679 


. 913 


43 .518 


1.00 


32 . 81 
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1339 


OH2 


WAT 


243 


1.248 


-.885 


54 .817 


1.00 


24.39 


1340 


OH2 


WAT 


244 


-8.178 


4 . 883 


76.583 


1.00 


26.66 


1341 


OH2 


WAT 


245 


-6.075 


15 .256 


48.229 


1.00 


27.33 


1342 


OH2 


WAT 


246 


-4 .529 


1. 584 


33 . 651 


1. 00 


26 .47 


1343 


OH2 


WAT 


247 


-1. 823 


14 .652 


48.096 


1. 00 


27.21 


1344 


OH2 


WAT 


248 


-10.286 


7.153 


39. 164 


1.00 


24 .26 


1345 


OH2 


WAT 


249 


8 .150 


8.218 


48 .429 


1.00 


23 .48 


1346 


OH2 


WAT 


250 


-8.406 


2.586 


77 . 882 


1. 00 


25 . 81 


1347 


OH2 


WAT 


251 


13 .368 


-6 . 143 


46 . 515 


1.00 


38 .51 


1348 


OH2 


WAT 


252 


- .009 


23 .620 


39.303 


1.00 


36.19 


1349 


OH2 


WAT 


253 


-9.460 


14 . 934 


66.850 


1.00 


34 . 00 


1350 


OH2 


WAT 


254 


-3.751 


20.245 


45 .512 


1.00 


37. 11 


1351 


OH2 


WAT 


255 


12 . 681 


10.291 


34 .535 


1.00 


26.21 


1352 


OH2 


WAT 


256 


6. 777 


.601 


58 .270 


1.00 


32 . 10 


1353 


OH2 


WAT 


257 


8.803 


20.438 


32 . 878 


1.00 


28.86 


1354 


OH2 


WAT 


258 


.450 


6.445 


65 . 189 


1 .00 


30.51 


1355 


OH2 


WAT 


259 


-16 .448 


9. 387 


39.020 


1.00 


41. 90 


1356 


OH2 


WAT 


260 


15.918 


6.602 


36. 928 


1.00 


43 .42 


1357 


OH2 


WAT 


261 


-8 . 987 


4.515 


44 . 837 


1.00 


34.46 


1358 


OH2 


WAT 


262 


6.985 


13 . 991 


29. 119 


1.00 


26.21 


1359 


OH2 


WAT 


263 


-7 .897 


-7.308 


39.485 


1.00 


38.23 


1360 


OH2 


WAT 


264 


6.735 


8.681 


53 . 846 


1.00 


30 . 04 


1361 


OH2 


WAT 


265 


-7.201 


10 .978 


28.692 


1.00 


40 . 13 


1362 


OH2 


WAT 


266 


-11.277 


11.811 


33 . 774 


1.00 


34 .24 


1363 


OH2 


WAT 


267 


-5 .570 


16.341 


27.209 


1.00 


27.61 


1364 


OH2 


WAT 


268 


28 . 978 


19.691 


29.458 


1.00 


34 . 54 


1365 


OH2 


WAT 


269 


- .394 


-3 . 182 


56. 909 


1.00 


37.28 


1366 


OH2 


WAT 


270 


-5 . 901 


-6.275 


48 . 732 


1.00 


38 . 98 


1367 


OH2 


WAT 


271 


-5.314 


13 .550 


50.350 


1.00 


28 .96 


1368 


OH2 


WAT 


272 


17.572 


7 . 657 


45.251 


1.00 


37 . 08 


1369 


OH2 


WAT 


273 


1 .725 


. 998 


24 .485 


1 .00 


41.72 


1370 


OH2 


WAT 


274 


14.327 


-4 . 819 


37.245 


1.00 


37 . 09 


1371 


OH2 


WAT 


275 


6.276 


11.721 


57. 649 


1.00 


47.33 


1372 


OH2 


WAT 


276 


-15 . 076 


17.669 


59.792 


1.00 


40 .97 


1373 


OH2 


WAT 


277 


- .429 


-2 .499 


53 . 645 


1.00 


33 . 17 


1374 


OH2 


WAT 


278 


3 . 038 


-3 . 057 


54.799 


1.00 


39.54 


1375 


OH2 


WAT 


279 


-3 . 954 


9.216 


72 . 671 


1.00 


39. 85 


1376 


OH2 


WAT 


280 


-10 .200 


1.398 


41.849 


1. 00 


42 . 94 


1377 


OH2 


WAT 


281 


-10 .292 


3 .217 


52 . 063 


1. 00 


43 . 76 


1378 


OH2 


WAT 


282 


-15 .225 


12 . 071 


60 . 987 


1.00 


45 .57 


1379 


OH2 


WAT 


283 


4.001 


-4 .280 


52 .274 


1.00 


29.89 


1380 


OH2 


WAT 


284 


-8.055 


-8.627 


42 .074 


1.00 


41.79 


1381 


OH2 


WAT 


285 


-1.008 


-6.151 


62 .404 


1.00 


46.22 


1382 


OH2 


WAT 


286 


-9 . 176 


10 .031 


51.259 


1.00 


36 . 18 


1383 


OH2 


WAT 


287 


-3 .447 


5. 807 


27 . 025 


1.00 


39. 54 


1384 


OH2 


WAT 


288 


6 . 178 


1 . 935 


27 . 972 


1 . 00 


41.27 


1385 


OH2 


WAT 


289 


- .878 


11.803 


51. 809 


1.00 


31.35 


1386 


OH2 


WAT 


290 


4 .340 


. 648 


60 . 998 


1.00 


32 . 63 


1387 


OH2 


WAT 


291 


7.811 


-2 . 038 


52 . 827 


1.00 


36.03 


1388 


OH2 


WAT 


292 


-8 . 166 


18 .423 


67.710 


1. 00 


38. 02 


1389 


OH2 


WAT 


293 


-14 . 604 


10 .287 


35.634 


1.00 


45. 62 


1390 


OH2 


WAT 


294 


-10.474 


20 .227 


53 . 859 


1.00 


46.74 


1391 


OH2 


WAT 


295 


17 . 892 


23 . 932 


22 .437 


1. 00 


42 . 17 


1392 


OH2 


WAT 


296 


17 . 145 


17 .710 


24 . 729 


1. 00 


47 . 54 


1393 


OH2 


WAT 


297 


1.480 


20 . 841 


46. 728 


1.00 


36. 92 


1394 


OH2 


WAT 


298 


-10.284 


6.137 


64. 005 


1. 00 


46.42 
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1395 


OH2 


WAT 


299 


3 .682 


3 .530 


62 .317 


1. 00 


44.53 


1396 


OH2 


WAT 


300 


-2 . 017 


-8 . 476 


51. 591 


1.00 


45 .29 


1397 


OH2 


WAT 


301 


- .106 


11.221 


83.869 


1.00 


37 . 74 


1398 


OH2 


WAT 


302 


-8.555 


9.470 


27.344 


1.00 


50.52 


1399 


OH2 


WAT 


303 


6.694 


27 . 979 


40 .780 


1.00 


40.54 


1400 


OH2 


WAT 


304 


-9.496 


-4 .420 


47 .793 


1.00 


40.89 


1401 


OH2 


WAT 


305 


-15 .396 


11.041 


56 .150 


1.00 


44 .20 


1402 


OH2 


WAT 


306 


-5.928 


-11.415 


43 .389 


1.00 


48 .32 


1403 


OH2 


WAT 


307 


-10 .479 


-7.663 


44.866 


1.00 


52 . 88 


1404 


OH2 


WAT 


308 


14.262 


4 . 179 


49.470 


1.00 


46. 97 


1405 


OH2 


WAT 


309 


- . 041 


15 . 184 


62 .705 


1.00 


40.11 


1406 


OH2 


WAT 


310 


15 .939 


3 . 184 


44 . 811 


1.00 


44 .49 


1407 


OH2 


WAT 


311 


3 . 817 


-.929 


63 . 168 


1.00 


42 . 13 


1408 


OH2 


WAT 


312 


-10 . 174 


-.055 


47.822 


1.00 


44.73 


1409 


OH2 


WAT 


313 


13 . 924 


- . 156 


50.669 


1.00 


43 .49 


1410 


OH2 


WAT 


314 


4 . 943 


21.521 


46. 064 


1.00 


33 .15 


1411 


OH2 


WAT 


315 


-4 . 990 


11.627 


72 . 769 


1.00 


42 .40 


1412 


OH2 


WAT 


316 


-8.733 


15.608 


48.610 


1.00 


49.94 


1413 


OH2 


WAT 


317 


-7 . 762 


17 . 938 


49.355 


1.00 


40 .62 


1414 


OH2 


WAT 


318 


-4 . 978 


- .433 


31. 726 


1. 00 


37.41 


1415 


OH2 


WAT 


319 


-4 . 689 


22 . 825 


55 . 686 


1 . 00 


47 .23 


1416 


OH2 


WAT 


320 


11. 750 


-10.644 


40.083 


1. 00 


48 .25 


1417 


OH2 


WAT 


321 


-10.564 


18 . 537 


57.762 


1. 00 


37.40 


1418 


OH2 


WAT 


322 


7 .490 


10 . 973 


25 .298 


1. 00 


45 . 18 


1419 


OH2 


WAT 


323 


2 .274 


12 . 065 


76.050 


1 . 00 


38 . 17 


1420 


OH2 


WAT 


324 


-2 . 115 


17.289 


58.628 


1.00 


41. 95 


1421 


OH2 


WAT 


325 


16.399 


18 . 887 


20 . 072 


1.00 


48 . 74 


1422 


OH2 


WAT 


326 


20. 050 


19. 008 


27 . 094 


1.00 


45 .30 


1423 


OH2 


WAT 


327 


-10 .935 


-4 .259 


42 .863 


1 . 00 


52 .28 


1424 


OH2 


WAT 


328 


-5.431 


-5.213 


36.216 


1.00 


35.64 


1425 


OH2 


WAT 


329 


-9. 928 


22 . 834 


64 .306 


1.00 


45 .77 


1426 


OH2 


WAT 


330 


-8.275 


6.285 


42 . 925 


1.00 


42 . 06 


1427 


OH2 


WAT 


331 


-13 . 049 


17 .206 


57 . 668 


1.00 


38 . 15 


1428 


OH2 


WAT 


332 


5 . 624 


-10 . 159 


35 . 555 


1.00 


40 .21 


1429 


OH2 


WAT 


333 


23 .293 


26 . 941 


32 .316 


1.00 


48 .27 


1430 


OH2 


WAT 


334 


1.567 


.896 


62 .295 


1.00 


37 .23 


1431 


OH2 


WAT 


335 


-13 . 988 


-1.533 


56.782 


1.00 


49. 02 


1432 


OH2 


WAT 


336 


6.259 


13 . 631 


61.216 


1.00 


40.35 


1433 


OH2 


WAT 


337 


-9. 939 


-2 .809 


49.680 


1.00 


50 . 64 


1434 


OH2 


WAT 


338 


-10 . 829 


10 . 035 


61.256 


1.00 


39.01 


1435 


OH2 


WAT 


339 


16 . 685 


10 . 608 


37.337 


1.00 


48 . 68 


1436 


OH2 


WAT 


340 


-8 .780 


. 857 


55 . 000 


1 . 00 


42 . 72 


1437 


OH2 


WAT 


341 


-8 . 843 


-3 .223 


40 . 067 


1.00 


36. 16 


14 3 8 


OH2 


WAT 


342 


. 612 


11 .479 


80 . 681 


1 .00 


45 . 15 


1439 


OH2 


WAT 


343 


1.599 


-2 .462 


63 . 176 


1.00 


45.28 


1440 


OH2 


WAT 


344 


18. 905 


21.201 


43 .743 


1.00 


46.85 


1441 


OH2 


WAT 


345 


9.154 


-12 . 805 


43 . 973 


1.00 


45.21 


1442 


OH2 


WAT 


346 


-8.708 


18. 686 


55.960 


1.00 


41. 01 


1443 


OH2 


WAT 


347 


-3 . 987 


-7. 989 


48 . 762 


1.00 


44.36 


1444 


OH2 


WAT 


348 


-2 . 038 


-9.706 


42 .264 


1.00 


40.86 


1445 


OH2 


WAT 


349 


-11.305 


11. 929 


49.565 


1.00 


45. 07 


1446 


OH2 


WAT 


350 


3 .795 


3 .399 


77 . 919 


1.00 


48.57 


1447 


OH2 


WAT 


351 


8 . 628 


9 . 813 


50 . 645 


1.00 


39.75 
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Note to Table 1 - Coordinates from restrained individual B- 
f actor refinement, refinement resolution: 62.017 - 1.84 0 A; starting 
r= .2 597; free_r= .2 866; final r= .2444; free_r= .27 08; B rmsd for 
bonded mainchain atoms=.846; target= 1.5; B rmsd for bonded sidechain 
5 atoms= .985; target= 2.0; B rmsd for angle mainchain atoms= 1.551; 
target= 2.0; B rmsd for angle sidechain atoms= 1.623; target= 2.5; 
wa= 1.17512; rweight= . 53 0173 ; target= mlf; steps= 30; space group = 
P2 1 2 1 2 1 (a= 35.270; b= 43.903; c= 124.659; a=p=y=90°) ; B-correction 
resolution: 6.0 - 1.840; initial B-f actor correction applied to fobs: 

10 Bll= -1.655; B22= -3.389; B33= 5.044; B12= .000; B13= .000; B23= 

.000; B-f actor correction applied to coordinate array B: 2.789; bulk 
solvent: density level= .377734 e/A 3 ; B-factor= 30.924 A 2 reflections 
with | Fobs | /sigma_F < 0.0 rejected reflections with | Fobs | > 10000 * 
rms(Fobs) rejected theoretical total number of reflections in 

15 resolution range: 17526 (100.0 %) number of unobserved reflections 
(no entry or |F|=0) : 419 (2.4 %) number of reflections rejected: 0 
(0.0 %) total number of reflections used: 17107 (97.6 %) number of 
reflections in working set: 16241 (92.7 %) ; number of reflections in 
test set: 866 (4.9 %)CRYST1 35.270 43.903 124.659 90.00 90.00 

20 90.00 P 21 21 21. 



The human Pinl numbering sequence differs from that above due to the 
addition of four residues at the N-terminus from the E.coli cloning/expression vector. 
Thus, one should add 4 to the native human Pinl sequence to arrive at the numbering 
25 scheme above. For instance, Trpl 1 of the original sequence is now TrplS above. 



In addition, in accordance with this invention, WW domains, Pinl WW domains, 
or Pinl WW domain mutants may be crystallized in co-complex with known WW 
domain binding agents, substrates, or inhibitors. The crystal structures of a series of such 
30 complexes may then be solved by molecular replacement and compared with that of a 
wild-type WW domain. Potential sites for modification within the various binding sites 
of the WW domain may thus be identified. This information provides an additional tool 
for determining the most efficient binding interactions, for example, increased 
hydrophobic interactions, between a WW domain and a chemical entity or compound. 

35 All of the complexes referred to above may be studied using well-known X-ray 

diffraction techniques and may be refined versus 2-3 A resolution X-ray data to an R 
value of about 0.20 or less using computer software, such as X-PLOR (Y ale University, 
1992, distributed by Molecular Simulations, Inc.). See, e.g., Blundel & Johnson, supra; 
Methods in Enzymology, vol. 114 and 115, H. W. Wyckoff et al. 9 eds., Academic Press 

40 (1985). This information may thus be used to optimize known classes of WW domain 
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binding agents {e.g., inhibitors), and to design and synthesize novel classes of WW 
domain binding agents (e.g., inhibitors). 

The design of binding agents that bind or otherwise associate with or inhibit a 
WW domain according to the invention generally involves consideration of two factors. 
5 First, the compound or binding agent must be capable of physically and structurally 
associating with a WW domain. Non-covalent molecular interactions important in the 
association of a WW domain with a substrate include hydrogen bonding, van der Waals 
and hydrophobic interactions. 

Second, the compound or binding agent must be able to assume a conformation 
10 that allows it to associate with a WW domain. Although certain portions of the 
compound or binding agent will not directly participate in this association, those portions 
may still influence the overall conformation of the molecule. This, in turn, may have a 
significant impact on potency. Such conformational requirements include the overall 
three-dimensional structure and orientation of the chemical entity or compound in 
15 relation to all or a portion of the binding site, e.g., active site or accessory binding site of 
a WW domain {e.g., a Pinl WW domain), or the spacing between functional groups of a 
compound comprising several chemical entities that directly interact with a WW 
domain. 

The potential inhibitory or binding effect of a chemical compound on a WW 
20 domain may be analyzed prior to its actual synthesis and testing by the use of computer 
modeling techniques. If the theoretical structure of the given compound suggests 
insufficient interaction and association between it and a WW domain, synthesis and 
testing of the compound may be obviated. However, if computer modeling indicates a 
strong interaction, the molecule may then be tested for its ability to bind to a WW 
25 domain. Methods of assaying for WW domain activity are known in the art. Methods 
for assaying the effect of a potential binding agent can be performed in the presence of a 
known binding agent of a WW domain. For example, the effect of the potential binding 
agent can be assayed by measuring the ability of the potential binding agent to compete 
with a known binding agent. 



42 



An inhibitory or other binding compound of a WW domain may be 
computationally evaluated and designed by means of a series of steps in which chemical 
entities or fragments are screened and selected for their ability to associate with the 
individual binding pockets or other areas of a WW domain. 

5 One skilled in the art may use one of several methods to screen chemical entities 

or fragments for their ability to associate with a WW domain and more particularly with 
the individual binding pockets of the WW domain of Pinl. This process may begin by 
visual inspection of, for example, the active site on the computer screen based on the 
Pinl WW domain coordinates in Table 1. Selected fragments or chemical entities may 
10 then be positioned in a variety of orientations, or docked, within an individual binding 
pocket of a WW domain. Docking may be accomplished using software such as Quanta 
and Sybyl, followed by energy minimization and molecular dynamics with standard 
molecular mechanics forcefields, such as CHARMM and AMBER. 

Specialized computer programs may also assist in the process of selecting 
1 5 fragments or chemical entities. These include: 

1. GRID (Goodford, P. J., "A Computational Procedure for Determining 
Energetically Favorable Binding Sites on Biologically Important Macromolecules", J. 
Med. Chem., 28, pp. 849-857 (1985)). GRID is available from Oxford University, 
Oxford, UK. 

20 2. MCSS (Miranker, A. and M. Karplus, "Functionality Maps of Binding Sites: 

A Multiple Copy Simultaneous Search Method." Proteins: Structure. Function and 
Genetics, 11, pp. 29-34 (1991)). MCSS is available from Molecular Simulations, 
Burlington, Mass. 

3. AUTODOCK (Goodsell, D. S. and A. J. Olsen, "Automated Docking of 
25 Substrates to Proteins by Simulated Annealing", Proteins: Structure. Function, and 
Genetics, 8, pp. 195-202 (1990)). AUTODOCK is available from Scripps Research 
Institute, La Jolla, Calif. 
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4. DOCK (Kuntz, L D. et al, "A Geometric Approach to Macromolecule-Ligand 
Interactions", J. Mol. Biol, 161, pp. 269-288 (1982)). DOCK is available from 
University of California, San Francisco, Calif. 

Once suitable chemical entities or fragments have been selected, they can be 
5 assembled into a single compound or binding agent (e.g., an inhibitor). Assembly may 
be performed by visual inspection of the relationship of the fragments to each other on 
the three-dimensional image displayed on a computer screen in relation to the structure 
coordinates of the Pinl WW domain as set forth in Table 1. This would be followed by 
manual model building using software such as Quanta or Sybyl. 

10 Useful programs to aid one of skill in the art in connecting the individual 

chemical entities or fragments include: 

1. CAVEAT (Bartlett, P. A. et al, "CAVEAT: A Program to Facilitate the 
Structure-Derived Design of Biologically Active Molecules". In 'Molecular Recognition 
in Chemical and Biological Problems", Special Pub., Royal Chem. Soc, 78, pp. 182-196 

15 (1989)). CAVEAT is available from the University of California, Berkeley, Calif. 

2. 3D Database systems such as MACCS-3D (MDL Information Systems, San 
Leandro, Calif.). This area is reviewed in Martin, Y. C, "3D Database Searching in 
Drug Design", J. Med. Chem., 35, pp. 2145-2154 (1992)). 

3. HOOK (available from Molecular Simulations, Burlington, Mass.). 

20 In addition to the method of building or identifing a WW domain binding agent 

in a step-wise fashion one fragment or chemical entity at a time as described above, 
inhibitory or other WW domain binding compounds may be designed as a whole or "de 
novo" using either an empty active site or optionally including some portion(s) of a 
known inhibitors). These methods include: 

25 1. LUDI (Bohm, H.-J., "The Computer Program LUDI: A New Method for the 

De Novo Design of Enzyme Inhibitors", J. Comp. Aid. Molec. Design, 6, pp. 61-78 
(1992)). LUDI is available from Biosym Technologies, San Diego, Calif. 
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2. LEGEND (Nishibata, Y. and A. Itai, Tetrahedron, 47, p. 8985 (1991)). 
LEGEND is available from Molecular Simulations, Burlington, Mass. 

3. LeapFrog (available from Tripos Associates, St. Louis, Mo.). 

Other molecular modeling techniques may also be employed in accordance with 
5 this invention. See, e.g., Cohen, N. C. et al. 9 "Molecular Modeling Software and 
Methods for Medicinal Chemistry", J. Med. Chem., 33, pp. 883-894 (1990). See also, 
Navia, M. A. and M. A. Murcko, "The Use of Structural Information in Drug Design", 
Current Opinions in Structural Biology, 2, pp. 202-210 (1992). 

Once a compound or binding agent has been designed or selected by the above 
10 methods, the efficiency with which that compound may bind to a WW domain may be 
tested and optimized by computational evaluation. 

A compound designed or selected as a WW domain binding agent may be 
further computationally optimized so that in its bound state it would preferably lack 
repulsive electrostatic interaction with the target site. Such non-complementary (e.g., 
15 electrostatic) interactions include repulsive charge-charge, dipole-dipole and charge- 
dipole interactions. Specifically, the sum of all electrostatic interactions between the 
binding agent and the WW domain when the binding agent is bound to the WW domain, 
preferably make a neutral or favorable contribution to the enthalpy of binding. 

Specific computer software is available in the art to evaluate compound 
20 deformation energy and electrostatic interaction. Examples of programs designed for 
such uses include: Gaussian 92, revision C (M. J. Frisch, Gaussian, Inc., Pittsburgh, Pa., 
1992); AMBER, version 4.0 (P. A. Kollman, University of California at San Francisco, 
1994); QUANTA/CHARMM (Molecular Simulations, Inc., Burlington, Mass. 1994); 
and Insight II/Discover (Biosysm Technologies Inc., San Diego, Calif., 1994). These 
25 programs may be implemented, for example, using a Silicon Graphics workstation, IRIS 
4D/35 or IBM RISC/6000 workstation model 550. Other hardware systems and software 
packages will be known to those skilled in the art of which the speed and capacity are 
continually modified 
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Once a WW domain binding agent has been selected or designed, as described 
above, substitutions may then be made in some of its atoms or side groups in order to 
improve or modify its binding properties. Generally, initial substitutions are 
conservative, e.g., the replacement group will have approximately the same size, shape, 
5 hydrophobicity and charge as the original group. Such substituted chemical compounds 
may then be analyzed for efficiency of fit to a WW domain by the same computer 
methods described above. 

A further aspect of the invention encompasses methods of treatment using 
inhibitors or binding agents of a WW domain. The WW domain activity has been 
10 associated with a number of diseases including, for example, hypertension and 
muscular dystrophy (Staub and Rotin, Structure 4(5):495-9, 1996); and breast cancer 
(Bednarak et al 9 Cancer Res., 60(8):2140-5, 2000). 

Cell proliferative disorders contemplated for treatment using the invention 
compounds and methods disclosed herein include disorders characterized by 
15 unwanted, inappropriate or uncontrolled cell growth. Particular examples include 
cancer, fibrotic disorders, non-neoplastic growths such as benign prostatic 
hypertrophy, endometriosis, psoriasis, and the like. Cancers contemplated for 
treatment in accordance with the present invention include both solid tumors and 
hematopoeitic cancers such as leukemias and lymphomas. 

Solid tumors that are treatable utilizing the invention compounds and methods 
include carcinomas, sarcomas, osteomas, fibrosarcomas, chondrosarcomas, and the 
like. Specific cancers contemplated for treatment include breast cancer, brain cancer, 
lung cancer (non-small cell and small cell), colon cancer, pancreatic cancer, prostate 
cancer, gastric cancer, bladder cancer, kidney cancer, head and neck cancer, and the 
like. 

Fibrotic disorders are generally characterized by inappropriate 
overproliferation of non-cancerous fibroblasts. Examples include fibromyalgia, 
fibrosis (cystic, hepatic, idopathic pulmonary, pericardial, and the like), cardiac 
fibromas, fibromuscular hyperplasia, restenosis, atherosclerosis, fibromyositis, and 
30 the like. 



20 
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Invention compounds are additionally useful in inhibiting mitosis in 
pathogenic organisms and are, therefore, useful for treating infectious diseases. 
Particular infectious diseases treatable by the methods disclosed herein include 
bacterial infections and fungal infections. 

5 Bacterial infections contemplated for treatment using invention compounds 

and methods include infections caused by both gram-positive and gram-negative 
bacteria, including infections caused by Staphylococcus, Clostridium, Streptococcus, 
Enterococcus t Diplococcus, Hemophilus, Neisseria, Erysipelothricosis, Listeria, 
Bacillus, Salmonella, Shigella, Escherichia, Klebsiella, Enterobacter, Serratia, 
10 Proteus, Morganella, Providencia, Yersinia, Camphylobacter, Mycobacteria, and the 
like. Infection by such organisms causes a wide variety of disorders including 
pneumonia, diarrhea and dysentery, anthrax, rheumatic fever, toxic shock syndrome, 
mastoiditis, meningitis, gonorrhea, typhoid fever, gastroenteritis, brucellosis, cholera, 
bubonic plague, tetanus, tuberculosis, Lyme disease, and the like. 

1 5 Fungal infections contemplated for treatment using invention compounds and 

methods include systemic fungal infections, dermatophytoses and fungal infections of 
the genito-unrinary tract. Systemic fungal infections include those caused by 
Histoplasma, Coccidioides, Cryptococcus, Blastocyces, Paracoccidioides, Candida, 
Aspergillus, Nocardia, Sporothrix, Rhizopus, Absidia, Mucor, Hormodendrum, 

20 Phialophora, Rhinosporidium, and the like. Dermatophyte infections include those 

caused by Microsporum, Trichophyton, Epidermophyton, Candida, Pityrosporum, and 
the like. Fungal disorders of the genito-urinary tract include infections caused by 
Candida, Cryptococcus, Aspergillus, Zygomycodoides, and the like. Infection by such 
organisms causes a wide variety of disorders such as ringworm, thrush, San Joaquin 

25 fever or Valley fever, Gilcrist's disease, and the like. These infections can be 

particularly serious, and even fatal, in patients with a depressed immune system such 
as organ transplant recipients and persons with acquired immunodefficiency 
syndrome (AIDS). 

In a further aspect of the invention, invention compounds may be used as 
30 insecticides. The compounds of the invention prevent mitosis in insect cells, and thus 
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can be used to control the growth and proliferation of a variety of insect pests. This 
aspect of the invention has important applications in agriculture, such as in the field, 
in the storage of agricultural products, and the like. Additionally, invention 
compounds are useful for controlling insect populations in places inhabited by man, 
5 such as homes, offices, and the like. 

The particular invention compound(s) selected for therapeutic use as taught 
herein can be administered to a subject either alone or in a pharmaceutical 
composition where the compound(s) is mixed with suitable carriers or excipient(s). In 
treating a subject, a therapeutically effective dose of compound {i.e., active 
10 ingredient) is administered. A therapeutically effective dose refers to that amount of 
the active ingredient that produces amelioration of symptoms or a prolongation of 
survival of a subject. 

Toxicity and therapeutic efficacy of a compound can be determined by 
standard pharmaceutical procedures in cell culture or experimental animals. Cell 

15 culture assays and animal studies can be used to determine the LD 50 (the dose lethal to 
50% of a population) and the ED 50 (the dose therapeutically effective in 50% of a 
population). The dose ratio between toxic and therapeutic effects is the therapeutic 
index, which can be expressed as the ratio LD50/ED50. Compounds which exhibit 
large therapeutic indices are preferred. The data obtained from these cell culture 

20 assays and animal studies can be used in formulating a range of dosages suitable for 
use in humans. The dosage of such compounds lies preferably within a range of 
circulating concentrations that include the ED 50 with little or no toxicity. The dosage 
may vary within this range depending upon a variety of factors, e.g., the dosage form 
employed, the route of administration utilized, the condition of the subject, and the 

25 like. 

For any compound used in the method of the invention, the therapeutically 
effective dose can be estimated initially from cell culture assays by determining an 
IC50 (i-e., the concentration of the test substance which achieves a half-maximal 
inhibition of PPIase activity). A dose can then be formulated in animal models to 
30 achieve a circulating plasma concentration range that includes the IC50 as determined 
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in cell culture. Such information can be used to more accurately determine useful 
doses in humans. Levels in plasma may be measured, for example, by HPLC. The 
exact formulation, route of administration and dosage can be chosen by the individual 
physician in view of the patient's condition. (See e.g. Fingl et al. 9 1975, in "The 
5 Pharmacological Basis of Therapeutics", Ch. 1 pi). 

It should be noted that the attending physician would know how to and when 
to terminate, interrupt, or adjust administration due to toxicity, to organ dysfunction, 
and the like. Conversely, the attending physician would also know to adjust treatment 
to higher levels if the clinical response were not adequate (precluding toxicity). The 
magnitude of an administered dose in the management of the disorder of interest will 
vary with the severity of the condition to be treated, with the route of administration, 
and the like. The severity of the condition may, for example, be evaluated, in part, by 
standard prognostic evaluation methods. Further, the dose and perhaps dose 
frequency will also vary according to the age, body weight, and response of the 
individual patient. Typically, the dose will be between about 1-10 mg/kg of body 
weight. About 1 mg to about 50 mg will be administered to a child, and between 
about 25 mg and about 1000 mg will be administered to an adult. A program 
comparable to that discussed above may be used in veterinary medicine. 

Depending on the specific conditions being treated, such agents may be 
20 formulated and administered systemically or locally. Techniques for formulation and 
administration may be found in "Remington's Pharmaceutical Sciences," 1990, 18th 
ed., Mack Publishing Co., Easton, PA. Suitable routes may include oral, rectal, 
transdermal, vaginal, transmucosal, or intestinal administration; parenteral delivery, 
including intramuscular, subcutaneous, intramedullary injections, as well as 
25 intrathecal, direct intraventricular, intravenous, intraperitoneal, intranasal, or 
intraocular injections, just to name a few. 

For injection, compounds of the invention may be formulated in aqueous 
solutions, preferably in physiologically compatible buffers such as Hank's solution, 
Ringer's solution, or physiological saline buffer. For transmucosal administration, 
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penetrants appropriate to the barrier to be permeated are used in the formulation. 
Such penetrants are generally known in the art. 

Use of pharmaceutically acceptable carriers to formulate the compounds 
herein disclosed into dosages suitable for systemic administration is within the scope 
5 of the invention. With proper choice of carrier and suitable manufacturing practice, 
the compositions of the present invention, in particular those formulated as solutions, 
may be administered parenterally, such as by intravenous injection. The compounds 
can be readily formulated using pharmaceutically acceptable carriers well known in 
the art into dosages suitable for oral administration. Such carriers enable the 
10 compounds of the invention to be formulated as tablets, pills, capsules, dragees, 

liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by a subject 
to be treated. 

Agents intended to be administered intracellularly may be administered using 
techniques well known to those of ordinary skill in the art. For example, such agents 

15 may be encapsulated into liposomes, then administered as described above. 

Liposomes are spherical lipid bilayers with aqueous interiors. All molecules present 
in an aqueous solution at the time of liposome formation are incorporated into the 
aqueous interior. The liposomal contents are both protected from the external 
microenvironment and, because liposomes fuse with cell membranes, are efficiently 

20 delivered into the cell cytoplasm. Delivery systems involving liposomes are 

discussed in International Patent Publication No. WO 91/02805 and International 
Patent Publication No. WO 91/19501, as well as U.S. Patent No. 4,880,635 to Janoff 
et al These publications and patents provide useful descriptions of techniques for 
liposome drug delivery and are incorporated by reference herein in their entirety. 

25 Pharmaceutical compositions contemplated for use in the present invention 

include compositions wherein the active ingredients are contained in an amount 
effective to achieve the intended purpose. Determination of an effective amount is 
well within the capability of those skilled in the art, especially in light of the detailed 
disclosure provided herein. 
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In addition to the active ingredients, these pharmaceutical compositions may 
contain suitable pharmaceutically acceptable excipients and auxiliaries which 
facilitate processing of the active compounds into preparations which can be used 
pharmaceutically. 

5 The pharmaceutical compositions of the present invention may be 

manufactured in a manner that is itself known, e.g., by means of conventional mixing, 
dissolving, granulating, dragee-making, levigating, emulsifying, encapsulating, 
entrapping, lyophilizing processes, or the like. 

Pharmaceutical formulations for parenteral administration include aqueous 
solutions of the active compounds in water-soluble form. Additionally, suspensions 
of the active compounds may be prepared as appropriate oily injection suspensions. 
Suitable lipophilic solvents or vehicles include fatty oils such as sesame oil, or 
synthetic fatty acid esters, such as ethyl oleate or triglycerides, or liposomes. 
Aqueous injection suspensions may contain compounds which increase the viscosity 
of the suspension, such as sodium carboxymethyl cellulose, sorbitol, dextran, or the 
like. Optionally, the suspension may also contain suitable stabilizers or agents which 
increase the solubility of the compounds to allow for the preparation of highly 
concentrated solutions. 

Pharmaceutical preparations for oral use can be obtained by combining the 
20 active compounds with solid excipient, optionally grinding the resulting mixture, and 
processing the mixture of granules, after adding suitable auxiliaries, if desired, to 
obtain tablets or dragee cores. 

Suitable excipients are, in particular, fillers such as sugars, including lactose, 
sucrose, mannitol, sorbitol, and the like; cellulose preparations such as, for example, 
25 maize starch, wheat starch, rice starch, potato starch, gelatin, gum tragacanth, methyl 
cellulose, hydroxypropylmethylcellulose, sodium carboxymethylcellulose, 
polyvinylpyrrolidone (PVP), and the like, as well as mixtures of any two or more 
thereof. If desired, disintegrating agents may be added, such as cross-linked 
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polyvinyl pyrrolidone, agar, alginic acid or a salt thereof such as sodium alginate, and 
the like. 

Dragee cores are provided with suitable coatings. For this purpose, 
concentrated sugar solutions may be used, which may optionally contain gum arabic, 
5 talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, titanium dioxide, 
lacquer solutions, suitable organic solvents or solvent mixtures, and the like. 
Dyestuffs or pigments may be added to the tablets or dragee coatings for 
identification or to characterize different combinations of active compound doses. 

Pharmaceutical preparations which can be used orally include push-fit 
10 capsules made of gelatin, as well as soft, sealed capsules made of gelatin and a 

plasticizer, such as glycerol or sorbitol. The push-fit capsules can contain the active 
ingredients in admixture with filler such as lactose, binders such as starches, and/or 
lubricants such as talc or magnesium stearate and, optionally, stabilizers. In soft 
capsules, the active compounds may be dissolved or suspended in suitable liquids, 
15 such as fatty oils, liquid paraffin, or liquid polyethylene glycols. In addition, 
stabilizers may be added. 

The invention will now be described in greater detail by reference to the 
following non-limiting examples. 

EXAMPLES 

20 Protein purification and crystallography. His8-tagged Pinl was expressed and 

purified by Ni +2 -chelation chromatography, the histidine tag was removed by thrombin 
digestion, and the protein purified as described in Ranganathan et al, Cell 89:875-886, 
1997. All site-directed mutants were constructed using the QuikChange (Stratagene) 
protocol and purified like wild type Pinl . Crystals of the Pinl-YpSPTpSPS complex 

25 were grown in hanging drops at 4 °C by mixing 1 .0 jxl of the Pinl -peptide complex with 
1.0 \i\ of a reservoir solution containing 100 mM MOPSO-Na* (pH 7.0), 28% (v/v) PEG 
8000, 2 mM DTT, stabilized in 100 mM MOPSO-Na + (pH 7.0), 25% (v/v) PEG 8000, 
20% (v/v) glycerol, 1 mM DTT, and rapidly frozen in a 100 K stream of nitrogen gas. A 
native data set extending to 1.84 A resolution was collected at the Stanford Synchrotron 
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Radiation Laboratory, beamline 9-1 (X = 0.98 A). Data were processed with DENZO 
and scaled with SCALEPACK (Otwinowski, Z., Minor, W. Meth. Enzymol. 276:307- 
326, 1997). The crystals contain one Pinl -peptide complex per asymmetric unit and 
belong to space group P2i2i2i (a = 35.27 A, b = 43.90 A, c = 124.66 A, a - p = y = 90°). 
5 The final data set with an overall R merge of 6.2 % (R merge = 33.7 % in the highest 
resolution shell) is 97.3 % complete (98.2 % in the highest resolution shell) for the 
resolution range of 62.02 A to 1 .84 A. The structure of the Pinl-YpSPTpSPS repeat 
complex was solved by molecular replacement with AMORE (Navaza, J. Acta 
Crystallogr. A 50:157-163, 1994) using the Pinl monomer as a starting model 

10 (Ranganathan^a/., Cell 89:875-886,1997). The resulting model was then positionally 
refined against all the data between 62.02 A and 1 .84 A using the default bulk solvent 
model in CNS with maximum likelihood targets (Brunger et al 9 Acta Crystallogr. D 
54:905-921, 1998). After refining the rebuilt model, water molecules were added 
automatically using CNS and edited manually in O. The final positional refinement 

1 5 converged to a crystallographic R factor of 24% (R free = 27%) for all data (17107 

reflections) between 62.02 A and 1.84 A. This model consists of residues 1 to 38 and 51 
to 163 of human Pinl, the entire phosphopeptide, and 151 water molecules. Model 
quality was checked with PROCHECK (Laskowski et aL 9 J. Appl. Crystallogr. 26:283- 
291 , 1993). A total of 91 .7% of the residues are in the most favored regions of the 

20 Ramachandran plot, 6.8% in the additional allowed region, and 0.8% in the generously 
allowed region. Glu-5 borders the generously allowed region. 

Binding Analysis. To determine the affinity of full length Pinl and its isolated 
PPIase and WW domains for a set of rhodamine-labeled peptides as shown in Table 2, 
were obtained commercially (SynPep Corporation, Dublin, CA) and allowed to interact 

25 with Pinl, PPIase and WW domains. These peptides were derived from proteins 

previously shown to interact with Pinl . While the set of peptides examined is limited, a 
consensus emerges from comparison of the highest affinity interactions. Lyophilized 
peptides were dissolved in 25 mM HEPES-Na + (pH 7.5), 100 mM NaCl, 1 mM DTT 
and stored for short periods of time at -20 °C Peptides were labeled with the amine 

30 reactive reagent tetramethylrhodamine-5-(and-6)-isothiocyanate (Molecular Probes, Inc. 
Eugene, OR) using a 2:1 molar ratio of rhodamine to peptide in 0.1 M sodium 
bicarbonate (pH 9.0) at ambient temperature for 10 hr. Fluorescence data were collected 
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on a PTI Alphascan spectrofluoiimeter (Photon Technology Instruments, Santa Clara, 
CA). Dissociation equilibrium constants for Pinl-peptide complexes were determined 
by measuring the change in fluorescence anisotropy of a set of peptide solutions at 
constant concentration of peptide each of which contained varied concentrations of Pinl 
5 using the procedure described previously (Vinson et al , Biochemistry 37: 1 087 1-10880, 
1998). The core binding element determined from this group of peptides is PXpSP, 
where X cannot be Gly. Surprisingly, the best binding sequences differ from the 
consensus Pinl-binding sequence of WFYpSPR determined using a degenerate P.Ser- 
Pro anchored peptide library (Yaffe et aL, Science 278:1957-1960, 1997). Furthermore, 

10 the dissociation constants roughly segregate into two affinity groups, lower affinity 
interactions more typical of other WW domain-peptide complexes with dissociation 
constants greater than 50 |nM and higher affinity interactions with dissociation constants 
less then 30 ^lM. Several of these peptides yielded crystalline specimens; the most well 
ordered crystals obtained thus far included the doubly phosphorylated CTD peptide, 

15 YpSPTpSPS. 



Table 2 Dissociation equilibrium constants (jiM) for Pinl-peptide complexes a 





Labeled Peptide 


Pinl 


WW 


PPIase 




WFYSPFLE (Pintide) (SEQ ED NO:5) 


>540 


n.d. 


n.d. 


20 


WFYpSPFLE (Pintide) (SEQ ED NO:6) 


17±2.0 


44±9.5 


86+11 




VPRpTPV (Cdc25c-T48) (SEQ ED NO:7) 


4.9±1.1 


7.7±3.3 


n.b. 




YLGpSPI (Cdc25c-S168) (SEQ ID NO:8) 


69±10 


91±30 


>400 




LYRpSPS (Cdc25c-S214) (SEQ ID NO:9) 


47±7.3 


72±15 


>500 




GSSpSPV (Weel-S123) (SEQ ID NO:10) 


73±4.9 


n.d. 


n.d. 


25 


PPApTPP (Mytl-T412) (SEQ ED NO:l 1) 


12±2.1 


15±4.7 


n.b. 




PPGpSPP (Mytl-S416) (SEQ ED NO:12) 


80±18 


126±21 


n.b. 




STSpTPR (Mytl-T455) (SEQ ID NO: 13) 


46±6.5 


39±12 


n.b. 




YSPTSPS (CTD) (SEQ ED NO: 14) 


n.b. 


n.b. 


n.b. 




YpSPTSPS (CTD-S2) (SEQ ID NO: 15) 


61±6.3 


110±26 


n.b. 


30 


YSPTpSPS (CTD-S5) (SEQ ID NO: 16) 


30±3.9 


34±5.9 


>500 




YpSPTpSPS (CTD-S2/S5) (SEQ ED NO: 17) 10±0.83 


34±6.2 


390±82 



a Errors are deviations from theoretical binding isotherms. All peptides are derived 
from human proteins except for Pintide. n.b. is used when there is no detectable 
35 binding, n.d. stands for not determined. The phosphorylation sites are indicated with 
a single letter amino acid code and numbered according to sequences appearing in 
public databases. 
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Overall architecture. To map the recognition surface of Pinl for the CTD of 
hyperphosphorylated Pol Ho, we next carried out structural analysis of Pinl bound to a 
phosphoiylated heptad repeat of the CTD, N-YpSPTpSPS-C. Crystals of the Pinl- 
YpSPTpSPS complex were obtained and the structure solved by molecular replacement 
5 using the published model of Pinl (Ranganathan et al., Cell 89:875-886, 1997) and 
refined to 1.84 A resolution. The CTD phosphopeptide resides in the cavity separating 
the PPIase domain from the WW domain. Electrostatic, hydrogen bonding, and van der 
Waals contacts to residues projecting outward from the concave surface of Pinl 's WW 
domain constitute the Pinl -peptide binding interface (Fig. 1 A). No contacts between the 
10 bound peptide and the Pinl PPIase domain are seen in this crystal structure. 

Two conformational differences exist in the current Pinl -peptide co-crystal 
structure relative to the previously reported structure (Ranganathan et al, Cell 89:875- 
886, 1997). The first change occurs in the PPIase domain. A nearly 70° rotation of the 
Pl-al (residues 64-80) catalytic loop results in an exposed PPIase domain active site. 

1 5 Arg-68 and Arg-69, which confer preferential binding of phosphorylated substrates to 
the Pinl PPIase domain, now reside well outside of the proline ring binding pocket 
formed by His-59, Cys-1 13, Leu-122, Met- 130, Phe-134, and His-157 (Figs. 1 A-B). 
While a crystal packing interaction maintains this open active site architecture, the 
observed orientation of the pl-al catalytic loop likely reflects the inherent mobility of 

20 this selectivity filter when not engaged with a bound phosphopeptide. 

A second conformational change in the Pinl WW domain results in an 
exaggerated twist in the P-sheet concurrent with a contraction of the concave WW 
domain ligand binding surface. The upper third of p2' and p3' together with the p27p3' 
turn moves downward over the amino terminal half of the bound CTD peptide. This 

25 movement positions Arg-14 and Phe-25 in van der Waals contact with Pro-3 ' of the 
CTD peptide and Ser-32 within hydrogen bonding distance of the backbone carbonyl 
oxygen of Thr-4' on the CTD peptide. The lower half of the pi' and p2' strands 
together with their connecting loop pivots upward positioning Ser-16 and Arg-17 for 
hydrogen bonding to the phosphate on P.Ser-5' of the CTD peptide. The complete 

30 phosphate binding module of the Pinl WW domain encompasses the side chains of Ser- 
16, Arg-17, and Tyr-23 and the backbone amide of Arg-17. A bound water molecule 
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mediates the hydrogen bond between Tyr-23 and the serine phosphate. The Tyr-23/Trp- 
34 aromatic pair spatially defines the pivot point for all of these structural changes. 

The aromatic rings of these residues align edge-to-face forming an aromatic 
clamp which accomodates the backbone atoms of Thr-4' and P.Ser-5' and the ring atoms 
5 of Pro-6' of the CTD peptide. Finally, the Ser-7' side chain hydroxyl group forms a 
hydrogen bond with the protonated indole nitrogen of Trp-34 (Fig. 2A). Tyr-1 ' and 
P.Ser-2' of the CTD peptide exhibit a greater degree of flexibility reflected by the weak 
electron density associated with both of these CTD residues. The entire peptide binds as 
an extended coil with both P.Ser-Pro peptide bonds in the trans configuration. Modeling 
10 of either peptide bond in the cis configuration results in significant steric perturbations at 
the ligand binding interface that would likely result in a loss of peptide binding. This 
three-dimensional view supports an extended contact surface limited to five consecutive 
ligand residues with WW domains. 

Energetic significance of protein-ligand interactions. To more fully elaborate the 

1 5 WW domain peptide-binding interface quantitatively, the affinities of a limited series of 
Pinl mutants for the CTD peptide were measured by fluorescence anisotropy. The 
results of this analysis are summarized schematically in Figure 3. Replacement of Arg- 
17 by Ala results in a 6-fold decrease in binding affinity. In R17A Pinl, the remaining 
hydrogen bonds between the phosphate on Ser-5 ? and the side-chain hydroxyl group of 

20 Ser-16, the backbone amide of Ala- 17, and the water-mediated contact with the Tyr-23 
hydroxyl moiety provide an interface for low affinity binding of the phosphopeptide. E. 
nidulans Pinl contains an Asn residue in place of Arg-17 (Fig. 4A). While neutral, the 
amide side chain at this position can act as a hydrogen bond donor to the P.Ser 
phosphate. Therefore, WW domains possessing neutral hydrogen bond donors at 

25 positions equivalent to 16, 17, and 23 in Pinl may provide an energetically acceptable 
interface for P.Ser/P.Thr-Pro recognition. WW4 of the Nedd4-like HECT domain 
ubiquitin ligase WWP1 contains a conserved Thr-Arg segment homologous to Ser-16 
and Arg-17 of Pinl (Fig. 4A). Binding analysis of isolated WW1, WW3, and WW4 of 
WWP1 demonstrates phosphorylation-dependent binding only to WW4. Moreover, 

30 several HECT domain ubiquitin ligase WW domains including WW3 of Rsp5 contain an 
equivalent hydrogen-bonding motif (Thr, Asn, and Tyr - Fig. 4A). In addition to the 
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group IV WW domains, a number of phosphoprotein-binding domains exist including 
SH2, PTB, STYX, SBF, 14-3-3, and FHA domains (Plowman et al, Proc. Natl. Acad. 
Sci.USA 96:13603-13610,1999). While structurally distinct, nearly all such domains 
utilize Arg residues for the selective recognition of pSer, pThr, and pTyr2 1,22,23 side 
5 chains in a manner analogous to Pinl 's WW domain. 

Mutations of the other two side chain hydrogen bond donors, Ser-16 and Tyr-23 
to Ala and Phe, respectively, each result in a 2.5-fold decrease in binding affinity. Trp- 
34 constitutes one of two conserved Trp residues found in nearly all WW domains 
described to date. Together with Tyr-23, this aromatic pair organizes around the Thr-4' 

1 0 and P.Ser-5 9 backbone, and Pro-6' of the CTD peptide. Replacement of Trp-34 by Phe 
and Ala cause a 6-fold and 18-fold reduction in binding affinity, respectively. The 
W34A mutant phenotype is consistent with the reduction of side chain volume leading to 
less efficient packing of a portion of the CTD peptide backbone and the Pro ring. All 
Pinl homologs described to date include one additional residue in the turn linking pi' 

15 and 02' (Fig. 4A). This unique structural feature of group IV WW domains may 
facilitate the conformational change in the pi 7(32' turn that is necessary for the 
formation of the phosphate binding pocket. 

Surprisingly, mutation of Arg-14 and Phe-25, which are predicted to be 
energetically important based upon the co-crystal structure, results in enhanced binding 

20 affinity upon replacement. Structural analysis of the R14A Pinl mutant without a 
peptide bound to its WW domain suggests that the WW domain exists in the 
conformation observed in the current Pinl-phosphopeptide complex rather than that of 
the wild type unliganded Pinl complex. The enhanced binding affinity observed for the 
R14A, F25L, F25V, and F25A mutants might be due in part to preferential stabilization 

25 of the peptide-binding WW domain conformation in the absence of ligand. 

Comparison with other WW domains. The NMR-derived structure of the 
YAP65 WW domain complexed with a PPxY-containing peptide had a structurally 
distinct binding interface from that of the CTD heptide bound Pinl WW domain (Fig. 
4b) (Macias et a/., Nature 382:646-649, 1996). While this may reflect distinct 
30 mechanisms for ligand recognition by group I and group IV WW domains, the recent 
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structural analysis of the dystrophin WW domain in complex with a PPxY-containing 
peptide argues against this (Huang et al 9 Nature, submitted (2000)). Modeling the 
peptide bound to YAP65 in a manner similar to that observed in the Pinl-CTD complex 
results in a peptide orientation superimposed on the extended binding interface observed 
5 for Pinl . However, the direction of the polypeptide chain is reversed. This capacity to 
bind protein ligands in a bi-directional manner is consistent with the dystrophin WW 
domain structure and is reminiscent of the alternative binding modes utilized by SH3 
domains (Kuriyan, J., Cowburn, D., Annu. Rev. Biophys. Biomol. Struct. 26:259-288, 
1997). While Pinl and dystrophin bind their respective ligands in the opposite N- to C- 
1 0 terminal direction, apart from Pinl 's ability to bind phosphopeptides, the chemical 
features of the peptide-binding interface are nearly identical. 

The picture that emerges is of a rather limited WW domain contact surface that 
relies on a set of energetically modest side chain interactions, none of which is 
absolutely essential for ligand binding. Rather, the summed contributions of this module 
1 5 with ligands spanning five consecutive amino acid residues together with interactions 
with additional modular domains and longer polypeptide targets likely contribute to 
target selection in WW domain containing proteins such as Pinl. 



20 



While the invention has been described in detail with reference to certain 
preferred embodiments thereof, it will be understood that modifications and variations 
are within the spirit and scope of that which is described and claimed. 



